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Preface 


In the introduction to the first volume of The Arithmetic of Elliptic Curves 
(Springer-Verlag, 1986), I observed that “the theory of elliptic curves is 
rich, varied, and amazingly vast,” and as a consequence, “many important 
topics had to be omitted.” I included a brief introduction to ten additional 
topics as an appendix to the first volume, with the tacit understanding that 
eventually there might be a second volume containing the details. You are 
now holding that second volume. 

Unfortunately, it turned out that even those ten topics would not fit 
into a single book, so I was forced to make some choices. The following 
material is covered in this book: 


I. Elliptic and modular functions for the full modular group. 
II. Elliptic curves with complex multiplication. 
III. Elliptic surfaces and specialization theorems. 
IV. Néron models, Kodaira-Néron classification of special fibers, 
Tate’s algorithm, and Ogg’s conductor-discriminant formula. 
V. Tate’s theory of g-curves over p-adic fields. 
VI. Néron’s theory of canonical local height functions. 


So what’s still missing? First and foremost is the theory of modular 
curves of higher level and the associated modular parametrizations of ellip- 
tic curves. There is little question that this is currently the hottest topic 
in the theory of elliptic curves, but any adequate treatment would seem to 
require (at least) an entire book of its own. (For a nice introduction, see 
Knapp [1].) Other topics that I have left out in order to keep this book 
at a manageable size include the description of the image of the é-adic 
representation attached to an elliptic curve and local and global duality 
theory. Thus, at best, this book covers approximately half of the material 
described in the appendix to the first volume. I apologize to those who may 
feel disappointed, either at the incompleteness or at the choice of particular 
topics. 

In addition to the complete areas which have been omitted, there are 
several topics which might have been naturally included if space had been 
available. These include a description of Iwasawa theory in Chapter II, 
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the analytic theory of p-adic functions (rigid analysis) in Chapter V, and 
Arakelov intersection theory in Chapter VI. 

It has now been almost a decade since the first volume was written. 
During that decade the already vast mathematical literature on elliptic 
curves has continued to explode, with exciting new results appearing with 
astonishing rapidity. Despite the many omissions detailed above, I am 
hopeful that this book will prove useful, both for those who want to learn 
about elliptic curves and for those who hope to advance the frontiers of our 
knowledge. I offer all of you the best of luck in your explorations! 


Computer Packages 


There are several computer packages now available for performing compu- 
tations on elliptic curves. PARI and SIMATH have many built-in elliptic 
curve functions, there are packages available for commercial programs such 
as Mathematica and Maple, and the author has written a small stand-alone 
program which runs on Macintosh computers. Listed below are addresses, 
current as of March 1994, where these packages may be acquired via anony- 
mous ftp. 


PARI (includes many elliptic curve functions) 
math.ucla.edu 128.97.4.254 
megrez.ceremab.u-bordeaux.fr 147.210.16.17 

(directory pub/pari) 
(unix, mac, msdos, amiga versions available) 

SIMATH (includes many elliptic curve functions) 
ftp.math.orst.edu 
ftp.math.uni-sb.de 

apecs (arithmetic of plane elliptic curves, Maple package) 
math.mcgill.ca 132.206.1.20 

(directory pub/apecs) 

Elliptic Curve Calculator (Mathematica package) 

Elliptic Curve Calculator (stand-alone Macintosh program) 
gauss.math.brown.edu 128.148.194.40 

(directory dist /EllipticCurve) 


A description of many of the algorithms used for doing computations on 
elliptic curves can be found in H. Cohen [1, Ch. 7] and Cremona [1]. 
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Introduction 


In the first volume of The Arithmetic of Elliptic Curves, we pre- 
sented the basic theory culminating in two fundamental global results, 
the Mordell-Weil theorem on the finite generation of the group of rational 
points and Siegel’s theorem on the finiteness of the set of integral points. 
This second volume continues our study of elliptic curves by presenting six 
important, but somewhat more specialized, topics. 

We begin in Chapter I with the theory of elliptic functions and modular 
functions for the full modular group I'(1) = SL2(Z)/{+1}. We develop this 
material in some detail, including the theory of Hecke operators and the L- 
series associated to cusp forms for (1). Chapter II is devoted to the study 
of elliptic curves with complex multiplication. The main theorem here 
states that if K/Q is a quadratic imaginary field and if E/C is an elliptic 
curve whose endomorphism ring is isomorphic to the ring of integers of K, 
then K(j(£)) is the Hilbert class field of K; and further, the maximal 
abelian extension of K is generated by j(E£) and the z-coordinates? of the 
torsion points in E(C). This is analogous to the cyclotomic theory, where 
the maximal abelian extension of Q is generated by the points of finite 
order in the multiplicative group C*. At the end of Chapter II we show 
that the L-series of an elliptic curve with complex multiplication is the 
product of two Hecke L-series with Gréssencharacter, thereby obtaining at 
one stroke the analytic continuation and functional equation. 

The common theme of Chapters III and IV is one-parameter families 
of elliptic curves. Chapter III deals with the classical geometric case, where 
the family is parametrized by a projective curve over a field of characteristic 
zero. Such families are called elliptic surfaces. Thus an elliptic surface 
consists of a curve C, a surface €, and a morphism 7: € — C such that 
almost every fiber 7~1(t) is an elliptic curve. The set of sections 


{maps o : C > € such that mo a(t) = t} 


If j(E) = 1728 or j(E) = 0, one has to use x? or x° instead of z. 
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to an elliptic surface forms a group, and we prove an analogue of the 
Mordell-Weil theorem which asserts that this group is (usually) finitely 
generated. In the latter part of Chapter III we study canonical heights 
and intersection theory on € and prove specialization theorems for both 
the canonical height and the group of sections. 

Chapter IV continues our study of one-parameter families of ellip- 
tic curves in a more general setting. We replace the base curve C' by a 
scheme S = Spec R, where R is a discrete valuation ring. The generic fiber 
of the arithmetic surface € — S is an elliptic curve EF defined over the 
fraction field K of R, and its special fiber is a curve € (possibly singular, 
reducible, or even non-reduced) defined over the residue field k of R. We 
prove that if C + S is a minimal proper regular arithmetic surface whose 
generic fiber is EF’, and if we write € for the part of @ that is smooth over S, 
then € is a group scheme over S and satisfies Néron’s universal mapping 
property. In particular, E(K) © €(R); that is, every K-rational point on 
the generic fiber F’ extends to an R-valued point of €. We also describe the 
Kodaira-Néron classification of the possible configurations for the special 
fiber C and give Tate’s algorithm for computing the special fiber. At the 
end of Chapter IV we discuss the conductor of an elliptic curve and prove 
(some cases of) Ogg’s formula relating the conductor, minimal discrimi- 
nant, and number of components of @. 

In Chapter V we return to the analytic theory of elliptic curves. We 
begin with a brief review of the theory over C, which we then use to analyze 
elliptic curves defined over R. But the main emphasis of Chapter V is on 
elliptic curves defined over p-adic fields. Every elliptic curve FE defined 
over C is analytically isomorphic to C*/q” for some q € C*. Similarly, 
Tate has shown that if E is defined over a p-adic field K and if the j- 
invariant of E is non-integral, then E is analytically isomorphic to K*/q? 
for some g € K*. (It may be necessary to replace K by a quadratic 
extension.) Further, the isomorphism E(K) & K*/q” respects the action 
of the Galois group Gx/%, a fact which is extremely important for the 
study of arithmetic questions. In Chapter V we describe Tate’s theory 
of g-curves and give some applications. 

The final chapter of this volume contains a brief exposition of the 
theory of canonical local height functions. These local heights can be used 
to decompose the global canonical height described in the first volume 
[AEC, VIII §9]. We prove the existence of canonical local heights and give 
explicit formulas for them. Local heights are useful in studying some of the 
more refined properties of the global height. 

As with the first volume, this book is meant to be an introductory text, 
albeit at an upper graduate level. For this reason we have occasionally made 
simplifying assumptions. We mention in particular that in Chapter II we 
restrict attention to elliptic curves whose ring of complex multiplications 
is integrally closed; in Chapter III we only consider elliptic surfaces over 
fields of characteristic 0; and in Chapter IV we assume that all Dedekind 
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domains and discrete valuation rings have perfect residue fields. Possibly 
it would be preferable not to make these assumptions, but we feel that the 
loss of generality is more than made up for by the concomitant clarity of 
the exposition. 


Prerequisites 


The main prerequisite for reading this book is some familiarity with the ba- 
sic theory of elliptic curves as described, for example, in the first volume. 
Beyond this, the prerequisites vary enormously from chapter to chapter. 
Chapter I requires little more than a first course in complex analysis. Chap- 
ter II uses class field theory in an essential way, so a brief summary of class 
field theory has been included in (II §3). Chapter III requires various clas- 
sical results from algebraic geometry, such as the theory of surfaces and 
the theory of divisors on varieties. As always, summaries, references, and 
examples are supplied as needed. 

Chapter IV is technically the most demanding chapter of the book. 
The reader will need some acquaintance with the theory of schemes, such 
as given in Hartshorne [1, Ch. II] or Eisenbud-Harris [1]. But beyond that, 
there are portions of Chapter IV, especially IV §6, which use advanced 
techniques and concepts from modern algebraic geometry. We have at- 
tempted to explain all of the main points, with varying degrees of precision 
and reliance on intuition, but the reader who wants to fill in every detail 
will face a non-trivial task. Finally, Chapters V and VI are basically self- 
contained, although they do refer to earlier chapters. More precisely, the 
interdependence of the chapters of this book is illustrated by the following 
guide: 


y, 


Ch. I >» |{Ch. VI Ch. III} ---» [Ch. IV 


> Ch. V ‘A 


The dashed line connecting Chapter III to Chapter IV is meant to indicate 
that although there are few explicit cross-references, mastery of the subject 
matter of Chapter III will certainly help to illuminate the more difficult 
material covered in Chapter IV. 


References and Exercises 


The first volume of The Arithmetic of Elliptic Curves (Springer-Verlag, 
1986) is denoted by [AEC], so for example [AEC, VIII.6.7] is Theorem 6.7 
in Chapter VIII of [AEC]. All other bibliographic references are given by 
the author’s name followed by a reference number in square brackets, for 
example Tate [7, theorem 5.1]. Cross-references within the same chapter 
are given by number in parentheses, such as (3.7) or (4.5a). References 
from within one chapter to another chapter or appendix are preceded by 
the appropriate Roman numeral or letter, as in (IV.6.1) or (A §3). Exercises 
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appear at the end of each chapter and are numbered consecutively, so, for 
example, exercise 4.23 is the 23"4 exercise at the end of Chapter IV. 

Just as in the first volume, numerous exercises have been included at 
the end of each chapter. The reader desiring to gain a real understanding of 
the subject is urged to attempt as many as possible. Some of these exercises 
are (special cases of) results which have appeared in the literature. A list 
of comments and citations for the exercises will be found at the end of the 
book. Exercises marked with a single asterisk are somewhat more difficult, 
and two asterisks signal an unsolved problem. 


Standard Notation 
Throughout this book, we use the symbols 


Z, Q, R, C, Fy, and Z, 


to represent the integers, rational numbers, real numbers, complex num- 
bers, field with g elements, and p-adic integers respectively. Further, if R 
is any ring, then R* denotes the group of invertible elements of R; and if A 
is an abelian group, then A[m] denotes the subgroup of A consisting of all 
elements with order dividing m. A more complete list of notation will be 
found at the end of the book. 


CHAPTER I 


Elliptic and Modular Functions 


In most of our previous work in [AEC], the major theorems have been of 
the form “Let E/K be an elliptic curve. Then E/K has such-and-such 
a property.” In this chapter we will change our perspective and consider 
the set of elliptic curves as a whole. We will take the collection of all 
(isomorphism classes of) elliptic curves and make it into an algebraic curve, 
a so-called modular curve. Then by studying functions and differential 
forms on this modular curve, we will be able to make deductions about 
elliptic curves. Further, the Fourier coefficients of these modular functions 
and modular forms turn out to be extremely interesting in their own right, 
especially from a number-theoretic viewpoint. We will be able to prove 
some of their properties in the last part of the chapter. 

This chapter thus has two main themes, each of which provides a 
paradigm for major areas of current research in number theory and alge- 
braic geometry. First, when studying a collection of algebraic varieties or 
algebraic structures, one can often match the objects being studied (up 
to isomorphism) with the points of some other algebraic variety, called a 
moduli space. Then one can use techniques from algebraic geometry to 
study the moduli space as a variety and thereby deduce facts about the 
original collection of objects. A subtheme of this first main theme is that 
the moduli space itself need not be a projective variety, so a first task is to 
find a “natural” way to complete the moduli space. 

Our second theme centers around the properties of functions and dif- 
ferential forms on a moduli space. Using techniques from algebraic geom- 
etry and complex analysis, one studies the dimensions of these spaces of 
modular functions and forms and also gives explicit Laurent, Fourier, and 
product expansions. Next one uses the geometry of the objects to define 
linear operators (called Hecke operators) on the space of modular forms, 
and one shows that the Hecke operators satisfy certain relations. One then 
takes a modular form which is a eigenfunction for the Hecke operators 
and deduces that the Fourier coefficients of the modular form satisfy the 
same relations. Finally, one reinterprets all of these results by associating 
an L-series to a modular form and showing that the L-series has an Euler 
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product expansion and analytic continuation and that it satisfies a func- 
tional equation. 


81. The Modular Group 


Recall [AEC VI.3.6] that a lattice A C C defines an elliptic curve E/C via 
the complex analytic map 
C/A — En(C) : y* = 42° — gox — 93 
zt (9(2; A), @'(z;A)). 


(z;A) = +d (—- =) 


weEA 
wH40 


is the Weierstrass g-function relative to the lattice A. (See [AEC VI,§3].) 
Further, if Ay and Ag are two lattices, then we have 


Here 


Ea, =/c En, if and only if A; and A» are homothetic. 


(See [AEC VI.4.1.1]. Recall Ay and Ag are homothetic if there is a num- 
ber c € C* such that A; = cAg.) 

Thus the set of elliptic curves over C is intimately related to the set 
of lattices in C, which we denote by L: 


£ = {lattices in C}. 
We let C* act on & by multiplication, 
cA = {cw: w € A}. 


Then the above discussion may be summarized by saying that there is an 
injection 
{elliptic curves defined over Cy 

C-isomorphism 


LIC 


According to the Uniformization Theorem for Elliptic Curves (stated 
but not proven in [AEC VI.5.1]), this map is a bijection. One of our goals 
in this chapter is to prove this fact (4.3). But first we will need to describe 
the set £/C* more precisely. We will put a complex structure on £/C*, 
and ultimately we will show that £/C* is isomorphic to C. 

Let A € £. We can describe A by choosing a basis, say 


A = Zu, + Zu. 


Switching w, and wy if necessary, we always assume that the pair (we, w1) 
gives a positive orientation. (That is, the angle from w2 to w is positive 
and between 0° and 180°. See Figure 1.1.) 
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An Oriented Basis for the Lattice A 
Figure 1.1 


Since we only care about A up to homothety, we can normalize our 
basis by looking instead at 


1 
a eee a 
We W92 


Our choice of orientation implies that the imaginary part of w;/w satisfies 
Im(w1/w2) > 0, 
which suggests looking at the upper half-plane 
H= {7 eC: Im(r) > 0}. 
We have just shown that the natural map 


H— L/C*, 
TRO A, =Zr4+Z 


is surjective. It is not, however, injective. When do two 7’s give the same 
lattice? We start with an easy calculation. 
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Lemma 1.1. Let a,b,c,d€R,7€C,7 ¢R. Then 


at +b (ad — bc) Im(r) 
m = ; 
cer+d ler + dl? 


PrRooF. Let +r = s+it. Multiplying numerator and denominator by c7 +d, 
we find 


ar+b__ {acl7|? + (ad + bc)s + bd} + {(ad — be)t}i 
cT+d ler + dl? , 


O 
The ambiguity in associating a 7 € H to a lattice A lies in choosing an 
oriented basis for A. Suppose that we take two oriented bases, 


A = Zu, + Zw = Zw}, + Zw}. 
Then there are integers a,b, c,d, a’, b’,c’,d’ so that 


wy = aw, + bw, wy =a'w, + 0'w5, 
t een Z foot 
Wy = cw + dw, we = CW, +dwy. 


Substituting the left-hand expressions into the right-hand ones and using 
the fact that w,; and we are R-linearly independent, we see that 


a b a’ 1 0 
c d c dd 0 1)" 
Further, using Lemma 1.1 (with 7 = w;/w2) and the fact that our 
bases are oriented, we find that 


ee (4) es (= + =) = (ad — bc) Im(w1/we) _ 


we cw, + dwe |c(w 1 /w2) + dl? 


and so 
ad — be > 0. 


In other words, the matrix G 4) is in the special linear group over Z, 


& i) € SLo(Z) = 16 i) £048,746 € 2,08 — By =1}. 


This proves the first half of the following lemma. 
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Lemma 1.2. (a) Let AC C bea lattice, and let w1,w2 and w},w be two 
oriented bases for A. Then 


wy = aw, + bw2 


eee for some matrix C 2) € SLo(Z). 


(b) Let 7,72 € H. Then A,, is homothetic to A,, if and only if there is a 


matrix 
aT, +b 


ct, +d 


c 7 € SL2(Z) such that T= 


(c) Let Ac C be a lattice. Then there is at € H such that A is homothetic 
toA, =Z7r4+Z. 


PRooF. (a) This was done above. 
(b) Using (a), we find that 
A,, is homothetic to A,, 
<> Zr. +Z=ZatT,+Za for some a € C*, 
To = aaT, + ba ab 
SLo(Z 
an ee Teen & aye 2(Z), 
am +b 


=> 72 = F 
cm) +d 


Conversely, if T2 = (aT; +b) i (ct, +d), let a = cr; +d. Then again using (a), 
we find 
ahr, = Z(a7, + 6) + Z(er, +d) =Z7, + Z=A,,. 


Hence A,, and A,, are homothetic. 
(c) Write A = w1Z + wZ with an oriented basis and take rT = w/w. 
O 


In view of Lemma 1.2(b), it is natural to define an action of SL2(Z) 
on H as follows: 


aT+b ab 
= f = SL2(Z) and H. 
ace Sone or ¥ (ta)e 2(Z) and 7 € 
The fact that 7 is in H follows from Lemma 1.1, and the fact that this de- 
fines a group action is an easy calculation. This action gives an equivalence 
relation on the points of H, and Lemma 1.2(b) tells us what the cosets are. 
There is a bijection 


SLA(Z)\H OP eC 


T i? Ne: 


We can actually do a little bit better, since the matrix 


-=(9 4) 


acts trivially on H. 
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Definition. The modular group, denoted I'(1), is the quotient group 
T(1) = SLo(Z)/{+1}. 


Although I'(1) is the quotient SL2(Z)/{+1}, we will generally just 
write down matrices and leave it to the reader to remember that (0 *,) 


is equal to é ?). For an explanation of the notation I'(1), see exercise 1.6 


where we define groups I'(V) for all integers N > 1. 


Remark 1.3. Note that +1 are the only elements of SL2(Z) which fix H. 
For suppose that y = G 2) satisfies yr = 7 for all 7 € H. This means 


that 
cr? —(d—a)r —b=0 for all 7 € H, 


from which we conclude that c = b= 0 and a = d. Hence y = +1. 


Remark 1.4. The group ['(1) contains two particularly important ele- 
ments, which we will denote 


0 -1 1 1 
oh ae Se): 
Their action on H is given by 


S(r) = +, T(r) =7H+1. 


Notice also that the elements S and ST = (? 7) have finite order, 


2 3 
2 {0 -1l\ _ 3 {0 -1\ _ 
Ss oa ) =1 and (ST) se 1 = 1; 
so ['(1) contains finite subgroups of order 2 and 3. 


The next proposition provides us with a good description of the quo- 
tient space I'(1)\H. 


Proposition 1.5. Let $ CH be the set 
F={reEH:|r|>1 and |Re(r)| < 5}. 


(See Figure 1.2 for a picture of ¥ and some of its translates by elements 
of I\(1).) 

(a) Let r © H. Then there is a y € T'(1) such that yr € F. 

(b) Suppose that both t and yr are in F¥ for some y € (1), y #1. Then 
one of the following is true: 
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¥F and Some of Its ['(1)-Translates 


Figure 1.2 


(i) Re(7)=—5 and yr=rT+l,; 
(ii) Re(r)=45 0 and yr=r-4; 
(iii) |jrj=1 and yr =—l/r. 


(c) Let rt € F, and let 


Tate EL) yr ry 
be the stabilizer of r. Then 


{1,5} ifr =i; 

jad heh (ST) ) atrSpeet: 
{1,T7S,(TS)?} ifr =—p= erry 
{1} otherwise. 
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PROOF. (a) We prove something stronger. Let I’ be the subgroup of I'(1) 
generated by S = (? 0) and T = (4 t); and let r € H. We will prove 
that there is a y € I’ such that yr € F. 

For any 7 = G ) €T(1), Lemma 1.1 says that 


Iin(yr) = en. 


Write 7 = s + it. Since t > 0, it is clear that 
ler + dl? = (es +d)? + (ct)? + 00 as |e|+|d| — oo. 


Hence, for our fixed 7, there is a matrix yo € I’ which maximizes the 
quantity Im(yoT). Next, since T”r = 7 + n, we can choose an integer n so 
that 

|Re(T”yor)| < 5. 


We set y = Ty and claim that y7 € F. 
Suppose to the contrary that yr ¢ ¥. By construction, |Re(yr)| < 5, 
so we must have |yT| < 1. But then 
Im(yr 
Im(Sy7) = aa > Im(yr) = Im(707), 
contradicting the choice of yor to maximize Im(7yo7). This contradiction 
shows that yr € ¥, which completes the proof of (a). 


(b,c) We may assume that Im(yr) > Im(r), since otherwise we replace 


the pair 7,7 by the pair yr,y~1(yr). Writing y = a 3) as usual, we 


have 


Im(r) < Im(yr) = mr so |er+d| <1. 


Since Im(r) > 4V3, we must have |c| < 2/V3, so |c| < 1. Replacing + 
by —7 if necessary, it suffices to consider the cases c = 0 and c = 1. 


Then a =d=1 and yr =7T +b. Since 
|Re(r)| <4 and | Re(yr)| = |Re(7 + 8)| < $, 
it follows that 
b=+1 and Re(r) = ¥3-. 
c=1 
By assumption, |7| > 1 and |r +-d| <1. Writing 7 = s +7t, this means that 
1<s?+#2 and (s+)? +t? <1; 


so 
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1<s?+t? <1-2ds —d? =1—d(d+1) —d(2s #1). 


Since d € Z, the quantity d(d + 1) is non-negative. Similarly since |s| < 3, 
the quantity d(2s = 1) is non-negative for one of the choices of +/— sign. 
We conclude that 


Fear +P S21 and d(2s + d) = 0. 


We now look at several subcases. 


Then y = (t 0): and since |7| = 1, we have 
4 > |Re(yr)| =|Re(a —77")| = la — |. 
Hence one of the following three cases holds: 
; |r| =1, y=S, yt = -1/r; 


’ T=-p, Y= TS, y(-P) = —£; 
a=-l, s=-}, T=p, y=(STY?’, 7p = p. 


Then 7 = p and y = (7 *7 1), so 
y=a-——~=artp. 
p 
Since y7 € J, this leads to two cases: 


a=0, y=ST, YP = Pp; 


c=1d=-1,s=} 


Then 7 = —p, 7 = (t ea 2. and yT = a+7, so just as in the previous 
case there are two possibilities: 


a=0, y=(TS), (2) =-7 


a=-1, y=(q 4), -A) =e. 
O 
The geometric description of the quotient space ['(1)\H provided by 
Proposition 1.5 can be used to give a quick proof of the following purely 
algebraic fact. 
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Corollary 1.6. The modular group (1) is generated by the matrices 
0 -1l 1 1 
s-(? a and r=(4 ae 
ProorF. As in the proof of Proposition 1.5(a), we let I’ be the subgroup 
of (1) generated by S and T. Fix some 7 in the interior F, such as T = 2i. 
Let y € P'(1). From the proof of (1.5a) there is a y’ € I’ such that y/(yr) € 


FJ. Thus 7 is in the interior of F, and (y'y)r is in F¥. We conclude from (1.5b) 
that y7/ = 1. Therefore y = 7/~! € I’, which proves that I’ = I'(1). Oo 


Remark 1.6.1. It is in fact true that ['(1) is the free product of its sub- 
groups (S) and (ST) of orders 2 and 3. See exercise 1.1. 


§2. The Modular Curve X (1) 


The quotient space ['(1)\H classifies the set of lattices in C up to homoth- 
ety. Proposition 1.5 provides a nice geometric description of T'(1)\H. The 
vertical sides of the fundamental domain F are identified by T, and the 
two arcs of the circle |r| = 1 are identified by S, as shown in Figure 1.3. 
Making these identifications, we see that as a topological space, I'(1)\H 
looks like a 2-sphere with one point missing. Our next tasks are to supply 
that missing point, define a topology, and make the resulting surface into 
a Riemann surface. 

Rather than adding a single point to ['(1)\H, we will give a more gen- 
eral construction which is useful for generalizing the results of this chapter. 


Definition. The extended upper half-plane H* is the union of the upper 
half-plane H and the Q-rational points of the projective line, 


H* = HUP!(Q) = HU QU {oo}. 


One should think of P!(Q) as consisting of the rational points on the real 
axis together with a point at infinity. The points in P(Q) are called the 
cusps of H*. 


There is a natural action of [(1) on P!(Q) defined by 


a b z|  |ax+by 
c dj) \y| |ext+dy|- 
(Here we use [*] to denote homogeneous coordinates for a point in P'(Q).) 
Thus I'(1) acts on the extended upper half-plane H*. We define 
Y(1) =[(1)\H and X(1) =1(1)\H*. 


The points in the complement X(1) \ Y(1) are called the cusps of X(1). 
We now show that X(1) has only one cusp and calculate its stabilizer. 
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topological 


The Geometry of I'(1)\H 
Figure 1.3 


Lemma 2.1. (a) 
X(1) \ Y(1) = {oo}. 


(b) The stabilizer in T(1) of 00 € H* is 


I(coo) = {(6 ) € ry} = (the subgroup of I'(1) generated by T). 


PROOF. (a) Let (ol € P!(Q) be any point in H* \H. Since x and y are 


homogeneous coordinates, we may assume that x,y € Z and ged(z,y) = 1. 
Choose a,b € Z so that ax + by = 1. Then 


(5 eno om of)-L 


Therefore every point in H* \ H is equivalent (under the action of I'(1)) 


ee have & 3) ‘a — [0 if and only if c = 0. Hence (3 4) has the 


form ae : oO 


Topologically, X(1) looks like a 2-sphere. To make this precise, we 
need to describe a topology on X(1). We start by giving a topology for H*. 


16 I. Elliptic and Modular Functions 


A Neighborhood of so 


Some Open Sets in H* 
Figure 1.4 


Definition. The topology of H* is defined as follows. For rt € H, we take 
the usual open neighborhoods of 7 contained in H. For the cusp oo, we 
take as a basis of open neighborhoods the sets 


{rt €H: Im(r) > &} U {co} for every Kk > 0. 
For a cusp T # co, we take as a basis of open neighborhoods the sets 
{the interior of a circle in H tangent to the real axis at 7} U {r}. 


(See Figure 1.4.) 


Remark 2.2.1. For any cusp 7 4 co, Lemma 2.1(a) says that there is 
a transformation y € I'(1) with yoo = 79. Then one easily checks that y 
sends a set of the form {Im(r) > «} to the interior of a circle in H tangent 
to the real axis at 7). (See exercise 1.2.) In other words, the fundamental 
neighborhoods of oo and of the finite cusps are sent one-to-another by the 
elements of (1). 


Remark 2.2.2. From the definition, it is clear that distinct points of H* 
have disjoint neighborhoods. Hence H* is a Hausdorff space. It is also clear 
from (2.2.1) that the elements of ['(1) define homeomorphisms of H*. 


The next lemma will help us describe the topology on the quotient 
space X(1) = T(1)\H*. It will also be used later to define a complex 
structure on X(1). 
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Lemma 2.3. For any two points 71,72 € H"*, let 
I(7, 72) = fy EP (1): y71 = to}, 
and similarly, for any two subsets U,,U2 C H*, let 
I(U,, U2) = {y € TA): WU NU, FO}. 


Then, for all 71,72 € H*, there exist open neighborhoods U;,U2 © H* 
of T,,T2 respectively such that 


(Ui, U2) = I(71,72). 


(In other words, if yU, and Uz have a point in common, then necessar- 
ily yT1 = T2.) 


PRroor. For any a, @ € T'(1) we have 
I(at1, Bt2) = BI(m1,T2)a7* and I(aU,, BU2) = BI(U,,U2)a7}. 


It thus suffices to prove the lemma for any ['(1)-translates of 7, and 79. 
Using (1.5a) and (2.la), we may assume that 


71,72 € F* = FU {co}. 


From (1.5) and (2.1), we have a good description of how I'(1) acts on H* 
and 3*, as illustrated in Figure 1.2. We consider three cases, depending on 
whether or not our points are at oo. 


From (1.5) (or Figure 1.2) we see that I(¥,F) is finite; explicitly, 
HES SAL TTS ATSTAESY SST SES AST YT ty. 
Let 


G = Interior( UJ as) 
EM FF) 


Then G is an open subset of H containing ¥. Further, (3, 9) is finite, since 


18,9) Lo inFnA= UO wl, F)7t. 
ys y2El(F,F) y1y2EL(F,F) 


Next we observe that if y € (9,9) \ I(m,72), so ym # 72, then we 
can find open sets V,, W, in H satisfying 


YT1 Ee V,, Tz € Wy, and V,NW, =90. 
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Let 
aso. (a Me teas “Cy We 
yel(S,9) yEI(S,S) 
yEI(71,72) EI (71,72) 


By construction, 7 € U; and 72 € U2, so 
I(71,72) C I(U,, U2). 
Suppose that they are not equal, say y € [(U;,U2) \ I(™,72). Then 
y € 1(G,9) \ I(74, 72), and so y € I(y71 Vy, Wy) = I(Vy, W)y. 


But V, AW, = 0, so 1 ¢ I(V,,W,). This contradiction shows the other 
inclusion and completes the proof that I(7, 72) = I(Ui, U2). 


Let U,; be an open disk centered at 7,. As in the proof of Proposition 1.5, 
we observe that the quantity 


Im(r) 
&=k(U,;)= sup Im(yr)= — su mae 
( 1) al (¥ ) ae ler + dl? 
yer) ¢ byerq) 


is finite. (Note that if 7 = s + it € Ui, then s and ¢ are bounded, so 
jer+d|? = (es+d)?+(ct)? + 00 as |e|+|d|— oo uniformly int € U;.) 


Now 
Up = {7 © H: Im(r) > kK} U {co} 


will be a neighborhood of oo satisfying 
yU,NU2 =9 for all y € T(1). 


Hence 
(U,, U2) = i) = I(11, 72). 
Let 


Uso = {7 €H: Im(r) > 2} U {oo}. 
From (1.5) (or Figure 1.2) we see that the only elements of ['(1) which 


take some point in U,, to another point in Uj are powers of T. Hence 
from (2.1b) we conclude that 


I(Us0, Uso) = {T* € T(1) : k € Z} = I(co, 00). 
Oo 
Next we define a topology on X(1) and use Lemma 2.3 to show that 
X(1) is a Hausdorff space. Note that this fact requires proof; it is not 
immediate from the fact that H* is Hausdorff. (See exercise 1.3.) 


§2. The Modular Curve X(1) 19 


Definition. Let 
@:H* —+T(1)\H* = X(1) 


be the natural projection. The quotient topology on X(1) is defined by the 
condition that U C X(1) is open if and only if ¢~!(U) is open. Equivalently, 
it is the weakest topology for which ¢ is continuous. Note that ¢ is also an 
open map, that is, it takes open sets to open sets. For if W C H* is open, 
then so is 
¢ (dw) = LU we. 
yer(1) 

Proposition 2.4. X(1) with its quotient topology is a compact Hausdorff 
space. 


PROOF. We start by checking that X(1) is compact. Let {U;}:er be an 
open cover of X(1). Then {o"(Ui) bier is an open cover of H*. In par- 
ticular, some ¢~!(U;) contains 00, say oo € ¢~!(U;,). By definition of the 
topology on H*, there is a constant « > 0 so that 


@ 1(Ui,) D {7 CH: Im(r) > &} U {oo}. 


Hence the set F \ ¢~1(U;,) is compact (it is closed and bounded), so there 
is a finite subcover 


FN o (Ui) c o*(Uiz) Less Ug *(Ui,)- 


Then U;, U---UU;,, covers X(1). 

Next we verify that X(1) is Hausdorff. Let 21,22 € X(1) be distinct 
points, and let 71,72 € H* be points with ¢(7;) = z;. Then yr, 4 72 for 
all y € T'(1), so in the notation of (2.3), I(71,72) = @. From (2.3), there 
are open neighborhoods U;,U2 C H* of 71, 72 satisfying I(U;,U2) = @. 
Then $(U1), ¢(U2) are disjoint neighborhoods of 21, x2. oO 


Making X(1) into a compact Hausdorff space is a good start, but recall 
that our ultimate goal is to give X(1) a complex structure. We recall what 
this means. 


Definition. Let X be a topological space. A complex structure on X is 
an open covering {U;}:;e, of X and homeomorphisms 


HU; —> pi(Ui) CC 
such that each #;(U;) is an open subset of C and such that for all i,7 € I 
with U,; NU; 4 0, the map 
thy OW, : W(Ui Uj) — ¥j(Ui NU;) 


is holomorphic. The map y; is called a local parameter for the points in U;. 
A Riemann surface is a connected Hausdorff space which has a complex 
structure defined on it. 
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Theorem 2.5. The following defines a complex structure on X(1) which 
gives it the structure of a compact Riemann surface of genus 0: 

Let x € X(1), choose 7, € H* with $(7,) = x, and let U, C H* bea 
neighborhood of 7, satisfying 


I(Uz, Uz) = I (tz). 


(Such a U, exists from Lemma 2.3 with 7, = T2 = T, and U, = U; NUz2.) 
Then 


I(t_)\Uz C X(1) 


is a neighborhood of x, so {I(tz)\Uz} 


Let r = #1 (Tz), and let g, be the holomorphic isomorphism 


reX(1) 18 aM open cover of X(1). 


TOT, 
:H— C: |z| < 1}, 2(T) = ——. 
gH {zeCi ll <1}, gel) = 2 


Then the map 
te: I(T:)\Uz — C, Wa (d(r)) = gx(T)" 


is well defined and gives a local parameter at x. 
r= oo 


We may take tT, = 00, so I(tz) = {T*}. Then 


bee =e, velolr) = {Gein Foe 


is well defined and gives a local parameter at zx. 
Remark 2.5.1. If [(7,) = {1}, then the natural map 
b:U, > I(t)\Uz C X(1) 
is already a homeomorphism, so 
Qe = $7* :1(t)\Uz — Us 


is a local parameter at x. Thus the only real complication occurs when z 
equals ¢(2), 6(p), or (00). (See also exercise 1.4.) 
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Remark 2.5.2. The following commutative diagrams illustrate the defi- 
nitions of the local parameters wz : I(tz)\Uz © C. 


Ui 2s - Tes Us > AGO 
|= |v. Goo \ |v 
Ges. Cc Cc 
T— Tex TUT 
x # oO, Gx (T) = ayn I= OH, Joo (T) = e? 


PROOF (of Theorem 2.5). We already know that X(1) is a compact Haus- 
dorff space (2.4), and it is clearly connected due to the continuous surjec- 
tion @: H* > X(1). Further, an inspection of Figure 1.2 shows that X(1) 
has genus 0. (For those who dislike such a visual argument, we will later 
give an explicit map j : X(1) — P!(C). See (4.1) below. The interested 
reader can check that our proof that 7 is analytic does not depend on the 
a priori knowledge that X(1) has genus 0. Then the elementary argument 
described in exercise 1.11 shows that j is bijective, hence an isomorphism.) 
By construction, the set 


is a neighborhood of x. We must verify that the maps 
Wr: T(tz)\U, —2 C 


are well-defined homeomorphisms (onto their images) and that they satisfy 
the compatibility conditions for a complex structure. 

We begin with a lemma which shows that the function g,(7) behaves 
nicely with respect to the transformations in I(7,). 


Lemma 2.6. Let a € H, let R : H — H bea holomorphic map with 
R(a) =a, and let g(r) = (7 — a)/(7 —@). Suppose further that 


r times 


iN 
Ro---oR(r)=T 


and that r > 1 is the smallest integer with this property. Then there is a 
primitive r“’-root of unity ¢ such that 


g(Rt) = Cg(7T) for allt CH. 


Proor. Note that g is an isomorphism 


g:H—{zeEC: |z| <1} 
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with g(a) = 0, so the map 
G=goRog?:{zEC: |z| <1} — {z€C: |z| <1} 


is a holomorphic automorphism of the unit disk with G(0) = 0. It follows 
that G(z) = cz for some constant c € C. (See, e.g., Ahlfors [1].) Since 
the r-fold composition Go ---o G(z) = z and r is chosen minimally, we 
conclude that c is a primitive r*®-root of unity. oO 


We resume the proof of Theorem 2.5. Suppose first that 2 4 oo. Note 
that from (1.5), I(r) is cyclic, say generated by R. Then (2.6) implies that 


9x(Rr) = Cg(r) for all r € H, 
where ¢ is a primitive r*’-root of unity. Hence 
Wx (@(RT)) = ge(RT)” = C'G2(7)" = ve(O(7)), 


so Wz is well defined on the quotient I(7,)\Uz. 
Next we check that w, is injective. Let 71,72 € U;. Then 


Wx (b(t1)) = Ye (O(72)) > ge(t1)” = ge(T2)" 
<> gx(T1) = C'g2(T2) for some 0<i<r, 
=> g2(m) = gx(R't2) for some 0 <i <r, 
<> 7, = R'm for some0<i< r, 


<=> $(11) = $(72). 


Hence w, is injective. Finally, it is clear from the commutative diagram 
given in (2.5.2) that both w, and w;! are continuous, since the maps ¢, gx, 
and z +> 2” are all continuous and open. Therefore wy, is a homeomorphism. 

The case x = oo is similar. From (2.1b) we know that I(oo) = {T*} 
consists of the translations tr +> 7 +k for k € Z. Hence w,(¢(7)) = e27*7 
is well defined and injective on the quotient I(co)\U.. And, as above, wz 
and 1 are continuous, since both ¢ and rT +> e?"'™ are continuous and 
open. Hence w, is a homeomorphism. 

It remains to check compatibility. First let 2,y € X(1) with x, y 4 oo. 
Then 


Wy o wy! (z) = py 0 G0 (Wy 0 ¢)7}(z) = giy ogy! (2) 


Now gy and gz! are holomorphic, so the only possible problem would be 
the appearance of fractional powers of z. Let ¢ be the primitive r,*>-root 
of unity such that g,(R:T) = Cgx(r). Then using the fact that doy = ¢ 
for any 7 € ['(1), we find 


gh 0g. 1(C2) = Uy 060 Re ogz'(2) = dy obo gz'(2) = 9h? og5'(2): 
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It follows that g,” og; !(z) is a power series in 2"*, which proves that the 
composition wy 077 1(z) is holomorphic. (Note the importance of knowing 
that ¢ is a primitive r,*>-root of unity.) 

By exactly the same computation, taking g(r) = exp(27ir), the 
function 


boo 0 U5 1(z) = exp (2nigz (2"/"*)) 
is holomorphic. 
Finally, we note that 
Gy’ (7 +1) = yy 0 boT(r) = vy 0 O(7) = gy" (7), 
so gy’ (T) is a holomorphic function in the variable q = e?"'7. (Note T is 
restricted to Uy M Uj; it is not allowed to tend toward ioo.) Hence the 
transition map 


1 
by 0 Wey (2) = gf (= log :) 


is holomorphic. 
This completes the proof that the open sets I(7,)\Uz and the maps 


We 2 T(tr)\Uz — C 
define a complex structure on X(1). Oo 
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In the previous section we showed that the quotient space X(1) = T(1)\H* 
has the structure of a Riemann surface of genus 0. It is natural to look at 
the meromorphic functions on this Riemann surface. 


Example 3.1. Recall that to each 7 € H we have associated a lattice A; = 
Zr + Z and an elliptic curve C/A,. From Lemma 1.2(b) there is a well- 
defined map (of sets) 
T(1)\H_ — Cc 

T Pa (CAG): 
We will show later (4.1) that with the complex structure described in (2.5), 
the j function is a meromorphic function on X(1) which gives a complex 
analytic isomorphism 

gj: X(1) > P(C). 

Every meromorphic function f on X(1) is thus a rational function 
of j, that is, f € C(j). In order to have a richer source of functions, we will 
study functions on H that have “nice” transformation properties relative 
to the action of [(1) on H. Although these transformation properties may 
look somewhat artificial at first, the corresponding functions actually define 
differential forms on X(1), so they are in fact natural objects to study. 
(See (3.5) below for further details.) 
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Definition. Let k € Z, and let f(7) be a function on H. We say that f is 
weakly modular of weight 2k (for ['(1)) if the following two conditions are 
satisfied: 

(i) f is meromorphic on H; 

(ii) f(yr) = (er +d)** f(r) for all y = (2 i) €T(1),7 EH. 
Remark 3.2. Note that a function satisfying f(yr) = (cr +d)* f(r) for an 
odd integer « is necessarily the zero function, since taking y = ( ri A 
yields f(r) = —f(7). This explains why we restrict attention to even 
weights. 


Remark 3.3. Since (1.6) says that I'(1) is generated by the two matri- 
ces S = ‘ rs and T = ( 4 i , a meromorphic function f on H is weakly 
modular of weight 2k if it satisfies the two identities 


f(rt+l)=f(r) and f (=) gee fp 8 


From the first it follows that we can express f as a function of 
qa emit, 
and f will be meromorphic in the punctured disk 
{q:0< |g| < 1}. 


Thus f has a Laurent expansion 3 in the variable gq, or in other words, f 
has a Fourier expansion: 


lo <) 
f(q@= a ang” 
n=—o0o 
Definition. With notation as in (3.3), f is said to be 
x [oe 
meromorphic at co if f = > anq” for some integer no, 
n=—no 


[o.e} 
holomorphic at oo if f = Sang” 
n=0 


If f is meromorphic at oo, say f = Gnd "+--+ with a_n, # 0, then the 
order of f at co is 


ordeo(f) = ordg=o(f) = —no. 
If f is holomorphic at oo, its value at oo is defined to be 
f(co) = f(0) = a0. 


Definition. A weakly modular function that is meromorphic at oo is called 
a modular function. 
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Definition. A modular function that is everywhere holomorphic (i.e., ev- 
erywhere on H and at oo) is called a modular form. If in addition f(oo) = 0, 
then f is called a cusp form. 


Example 3.4.1. Let A be a lattice. The Eisenstein series 


Gal) = oe 


weEA 
wH0 


is absolutely convergent for all integers k > 2. (See [AEC VI.3.1].) For 7 € 
H we let 


1 
Gax(t)=Gae(Ar)= 0 ae 
m,neZ 
(m,n) A(0,0) 


By inspection, 


Gox(cA) = c7?* Gon (A) for any cE C*, 


whereas 
ar+b 1 1 
= = b d)) = ——A,. 
Nee waa t2 pg Etat + ) + Z(er + d)) aa? 
Hence 


Gax(yT) = Gar (Ayr) = Gan ((er + d)7* Ay) 
= (er + d)*Gox(Ar) = (er + d)”*Gox (7). 


Thus Go, is weakly modular of weight 2k. 


Proposition 3.4.2. Let k > 2 be an integer. The Eisenstein series Go, 
is a modular form of weight 2k. Its value at oo is given by G2,(oo) = 
2¢(2k), where C(s) is the Riemann zeta function. (For the complete Fourier 
expansion of Go, see (7.1).) 


PROOF. We have just shown that Geox is weakly modular, so it remains to 
show that Go, is holomorphic on H and at oo and to compute its value at oo. 
Note that if 7 is in the fundamental domain F described in Proposition 1.5, 
then 


[mr + n|? = m? |r? + 2mnRe(r) +n? > m? — mn +n? = |mp — nI?. 


Hence the series obtained from G2,(T) by putting in absolute values is 
dominated, term-by-term, by the series obtained from G2;(p) by putting 
in absolute values. Therefore Go, is holomorphic on F. But H is covered 
by the I'(1)-translates of F¥, and Gox(yT) = (cr + d)**Gox(T), so Gx is 
holomorphic on all of H. 
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Next we look at the behavior of Gox(T) as T — ioo. Since the series 
for Gog, converges uniformly, we can take the limit term-by-term. Terms 
of the form (mr + n)~?* with m 4 0 will tend to zero, whereas the others 
give n~2*, Hence 


CO 


F 1 
lim, Gax(7) = S aon = 2¢(2K). 
mMm=— Oo 
nO 
This shows that G2, is holomorphic at oo and gives its value. QO 


Example 3.4.3. It is customary to let 
g2(T) = 60G4 (rT) and g3(T) = 140G¢(r). 
(See [AEC VIL.3.5.1].) The (modular) discriminant is the function 
A(t) = go(7)? — 2793(r)?. 
It is a modular form of weight 12, since from (3.4.2) we know that G4(r) 


and G(r) are modular forms of weights 4 and 6 respectively. 
Using the well-known values (see (7.2) and (7.3.2)) 


an T 
(== and (6) = Se, 
we find that 
4 6 
g2(00) = 120¢(4) = Z-,—gs(oo) = 2806(6) =>, Aloo) =0. 


Hence A(r) is a cusp form of weight 12. We will see below (3.10.2) that it 
is essentially the only one. 


Remark 3.5. Let y = é i) € SL2(Z), and let dv be the usual differential 
form on H. Then 


dr = (er + d)~*dr. 


d(yr) -a(=*)) _ ad —be 


ct +d) (er +d)? 


Thus dr has “weight —2.” In particular, if f(7) is a modular function of 
weight 2k, then the k-form 
f(r) (dr) 


is [(1)-invariant. It thus defines a k-form on the quotient space I'(1)\H, at 
least away from the orbits of i and p, where the complex structure is a bit 
more complicated. 
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We will soon show that f(r) (dr)* actually defines a meromorphic k- 
form on X(1). We begin with a brief digression concerning differential 
forms on arbitrary Riemann surfaces. In particular, formula (3.6b) below 
will be crucial in our determination of the space of modular forms of a 
given weight. 


Definition. Let X/C be a smooth projective curve, or, equivalently, a 
compact Riemann surface. Recall that Qyx is the C(X)-vector space of 
differential 1-forms on X. (See [AEC II §4].) The space of (meromor- 
phic) k-forms on X is the k-fold tensor product 


OQ = OY = Ox ®erx) +++ Werxy Nx. 


OX is a 1-dimensional C(X)-vector space [AEC II.4.2a]. Notice that if we 
set OY = C(X), then @ O% has a natural structure as a graded C(X)- 
k=0 


algebra. 
Let w € O%, x € X, and choose a uniformizer t € C(X) at x. Then 


w = g(dt)* 
for some function g € C(X). We define the order of w at x to be 
ord,(w) = ord,(g). 


It is independent of the choice of t. (If ¢’ is another uniformizer, then ap- 
plying [AEC II.4.3b] we find that dt/dt’ is holomorphic and non-vanishing 
at x.) Just as with 1-forms, we define the divisor of w by 


div(w) = Y= ord, (w)(x) € Div(X); 


rex 


we say that w is regular (or holomorphic) if 
ord, (w) > 0 for alla Ee X. 


Proposition 3.6. Let X/C be a smooth projective curve of genus g, 
let k > 1 be an integer, and let w € OK. 

(a) Let Kx be a canonical divisor on X [AEC II §4]. Then div(w) is 
linearly equivalent to kKx. 


(b) 

deg(div w) = k(2g — 2). 
PROOF. (a) Let 7 € Q be a non-zero 1-form with divisor div(n) = Kx. 
Then 


F=w/n* € 0% =C(X) 
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is a function on X, so 
div(w) = k div(n) + div(w/n*) = kK x + div(F) 


is linearly equivalent to kK x. 
(b) From (a), deg(divw) = kdeg(Kx). Now apply the Riemann-Roch 
theorem [AEC II.5.4b], which says that deg(Kx) = 2g — 2. oO 


The next proposition gives the precise relationship between a modular 
function f of weight 2k and the corresponding k-form f(r) (dr)*. 


Proposition 3.7. Let f be a non-zero modular function of weight 2k. 


(a) The k-form f(r) (dr)* on H descends to give a meromorphic k-form w 


on the Riemann surface X (1). In other words, there is a k-form wy € Fa) 


such that 
b* (we) = f(r) (dr)*, 
where ¢: H > X(1) is the usual projection. 
(b) Let x € X(1), and let r, € H* with 6(Tz) = x. Then 


ord,, (f) ifx # (it), 6(p), d(x); 
Lordi(f)— 4k if = (i); 

3 ord,(f) — 2k ifx = $(p); 

orda.(f) —k if x = $(00). 


ord; (wy) = 


Remark 3.7.1. If f is a modular function, then it is easy to see that the 
order of vanishing of f at 7 € H depends only on the I'(1)-equivalence class 
of r. The point is that since f(yr) = (cr + d)?* f(r) and cr +d #0, we 
have 

ord, (f) = ord; (f 0 y~') =ord,-(f). 
Thus the expression in (3.7b) really does not depend on the choice of the 
representative Tz. 


Proor. (a) As we have seen, the k-form f(r) (dr)* is invariant for the 
action of I'(1) on H. We must show that for each x = ¢(7;) € X(1), 
the k-form f(r) (dr)* descends locally around x to a meromorphic k-form 
on X(1), and that it vanishes to the indicated order. Clearly, we will 
need to use the description of the complex structure on X(1) provided by 
Theorem 2.5. We consider two cases. 


Using the notation from (2.5), there is a commutative diagram 


Uz I(t)\Us 
[oe |v. 
C C 


Ze w=2" 
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which defines a local parameter 
bz (9(T)) = 92 (7) 
at x. We write 
T—Tr oe 


BGS) es w=2", and Pg. @y= 4 
x 


Tg BS Te 


¥ 


so w = 2” is our local parameter. 
Let R be a generator of I(r). Then from (2.6), 


gx(Rr) =Cg(r),  andso = Rr = Rogy'(z) = g7'(¢2). 


Here ¢ is some primitive r‘b-root of unity. 
Now 


f(r) (dr)* = f (95 1(2)) (doz '(z))* 
= f(95'(z))g5 (2)* (dz)* = F(z) (dz)*, 


where F(z) = f(gz'(z))gz''(z)* is a meromorphic function of z. Note 
further that since g, is a local isomorphism, we have 


ord,=7,(f) = ordz=o0(F). 


We must show that F(z) (dz)* is a meromorphic function of w = 2”. 
To do this, we use the fact (3.5) that f(r) (dr)* is T(1)-invariant. This 
implies 


F(z) (dz)* = f(r) (dr)* = f(Rr) (dRr)* 
= f(9z'(¢z)) (dgz1(¢z))" = F(¢2) (d¢z)* = F(¢2)¢* (dz)*. 


In particular, the function z*F(z) is invariant under the substitu- 
tion z+ Cz. Since ¢ is a primitive r**-root of unity, it follows that 


z* F(z) = F(z") 
for some meromorphic function F\(w). Hence 


F(z) (dz)* = r~*zkQ-7) F(z) (a(z"))* 
Seg hg E 2") (d(2"))* =r *w*F,(w) (dw)*, 


which proves that f(r) (dr)* descends to a meromorphic k-form wy in a 
neighborhood of z. 
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Finally, we compute 
ord;=,, f(T) = ordzao F(z) = ordz=o 2~* F(z") = —k + r ordy=o Fi(w); 
ord; wf = ordy=o r*w-* Fy (w) = —k + ordyao Fi (w). 
Eliminating ord, Fi(w) from these two equations yields 
ord, wf = * onde, f- ¢ — ~ | k. 
It only remains to note that from (1.5), 
Life £41), 60), 


r= (2 ifa=¢(2), 
3 ifa = (p). 


r= C 


Again using (2.5), we have a local parameter 
be i T(00)\Uxo + C, be ($(r)) =P. 


Let q = e?"'7 be the local parameter at oo, and write f(r) = f(q) as 
n (3.3). Since dr = (27iq)~1 dq, we have 


f(r) (dr)* = f(q)(2miq)~* (dq). 


By definition, f is meromorphic at q = 0, so f(r) (dr)* descends to a 
meromorphic k-form wy in a neighborhood of oo. Finally, 


ord. wf = ordg=o f(q)(20ig)~* = ordoo(f) — k- 
O 
Proposition 3.7 describes the local behavior of the k-form wy € Cay: 
The Riemann-Roch theorem, specifically Proposition 3.6(b), gives a global 
description of its degree. Combining these results, we obtain the following 
important formula. 


Corollary 3.8. Let f be a non-zero modular function of weight 2k.Then 


1 1 k 
g rdi(f) + 3 ordp(f) + ordoo(f) + > ord, (f) = ze 
7€F(1)\H* 
THi,P, CO 
(Here the sum is over any set of representatives for T'(1)\H* excluding the 
equivalence classes containing i, p, and oo.) 


ProoF. First note that from (3.7.1), the sum is independent of the choice 
of representatives for I(1)\H*. Let wp € OK (1) be the k-form corresponding 
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to f(r) (dr)* as in Proposition 3.7. By the Riemann-Roch theorem (3.6b) 
and the fact that X(1) has genus 0 (2.5), we find that 


deg(div wy) = —2k. 
On the other hand, (3.7) gives 
deg diviey) = (fords f — J) + (Bord, f ~ 34) 
+(ordeo f—k)+ $5 ord,(f). 
reP(1)\H" 
THI, P,CO 


Equating these two expressions for deg(divwy) gives the desired formula. 
oO 


Using Corollary 3.8, we can give a good description of the space of all 
modular forms of a given weight. We set the notation 


Mox = {modular forms of weight 2k for T'(1)}, 
M8, = {cusp forms of weight 2k for I'(1)}. 


Note that both Mo, and M8. are C-vector spaces. 


Example 3.9. For all k > 2, the Eisenstein series Gox(T) is in Mo, but 
is not in M§,. The modular discriminant A(r) is in M?,. See (3.4.2) 
and (3.4.3). 


Theorem 3.10. (a) For all integers k > 2, 
Ma = M8, + CGoz. 


(b) For all integers k, the map 
Mox-12 —> MQ, fr fA 


is an isomorphism of C-vector spaces. 
(c) The dimension of Mo, as a C-vector space is given by 


0 ifk <0; 
dim M2, = ¢ [k/6] ifk > 0, k = 1 (mod 6); 
[k/6+1] ifk>0,k#1(mod 6). 


(The square brackets denote greatest integer. For an alternative proof of (c) 
using the Riemann-Roch theorem, see exercises 1.8 and 1.9.) 


PrRooF. (a) By definition, MQ, is the kernel of the map 


Mo —C, ~— f+ f (ov), 
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so M2,,/MS,, has dimension at most 1. On the other hand, for k > 2, the 
Eisenstein series G2, is in M2, and is not in M$,. (See (3.4.2).) Hence 


Mox = M3,+CGox for all k > 2. 


(b) First we note that 
Ga(p) = (p + 1)*Ga(STp) = p?Ga(p) and Gé(i) = i°Go(St) = —Ge(é), 


which implies that G4(p) = 0 and Gg(i) = 0. Since G4 and Gg are modular 
forms of weight 4 and 6 respectively (3.4.2), it follows from (3.8) that they 
have no other zeros in I'(1)\H. In particular, 


A(p) = (60G4(p))” — 27(140G6(p))” = —213°5°7°Ge(p)? #0, 


so A(r) is not identically zero. 
Thus A(r) is a non-zero modular form of weight 12 with A(oco) = 0. 
It follows from (3.8) that 
ordgo(A) = 1 


and that A(r) 4 0 for all 7 € H. (For an alternative proof that A(r) 4 0 
for all 7 € H, see [AEC VI.3.6a].) Therefore 1/A has a simple pole at oo 
and no other poles, so the map 


0 
Mo, — Mok-12, 


fr f/A 


is well-defined. (The main point is that as long as f vanishes at oo, 
then f/A will still be holomorphic at oo.) This gives an inverse to the 
map in (b), so M9, ~ Mox-12. 
(c) If k < 0, then (3.8) implies immediately that Mo; = 0. (Note that all 
of the terms in the left-hand sum are non-negative.) Similarly, if f € Mo, 
then (3.8) says that f has no zeros on H*. Thus f gives a holomorphic 
non-vanishing function on X(1). But X(1) is a compact Riemann surface, 
so an analytic map [f,1] : X(1) — P1(C) is necessarily either constant or 
surjective. Hence f is constant and Mo = C. 

Next we use (3.8) to describe all functions f € Mo, for small values 
of k. Note that for small values of k, the equation 


2 3 ~ 6 


will have very few solutions in non-negative integers a,b,c. For example, 
if k = 1, there are no solutions. We compile the results in Table 1.1. 
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Table 1.1 


ord, f basis 


k ord; f ord, f mere for Moz 


if _ aoa cao 0 
2 0 1 0 G4 
3 if 0 0 Ge 
4 0 2 0 Gi 
5 1 1 0 GiGe 


Everything in Table 1.1 is clear except that the functions in the final 
column actually form a basis. They are in Mo, from (3.4.2), so we need to 
show that M2, has dimension 1. But if fi, fo € Mo, with 2< k < 5, then 
Table 1.1 shows that f; and fo have exactly the same zeros. Hence f/f2 € 
Mo = C, which proves that dim(M2,) = 1 for2<<k <5. 

We have now verified (c) for all integers k < 5. On the other hand, 
if k > 0, then using (a) and (b) we find that 


dim Mox412 = dim M341 +1 from (a) 
= dim Mo, +1 from (b). 


Thus the left-hand side of (c) increases by 1 when k is replaced by k + 6. 
Since the same is true of the right-hand side, an easy induction argument 
completes the proof. oO 


Example 3.10.1. Each of the vector spaces 
Mo, Ma, Me, Ms, Mo, Mia 


has dimension 1. For example, since G2? € Mg and Gg € Mg, it follows 
immediately that 
Gs = cG? 


for some constant c € C. Letting 7 — ioo and using (3.4.2), we can even 


compute 
_ 2¢(8) _ 3 


ame ae 


(See (7.2) for the calculation of ¢(8).) Similarly, Gio = 4G4Ge and Gi4 = 
30;GiGs. More generally, M2, has a basis consisting of functions of the 
form G{G@. (See exercise 1.10.) To appreciate the subtlety of identities 
such as these, the reader might try to give a proof that Gg = 3G} directly 


from the series definition (3.4.1) of the Go,’s. 
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Example 3.10.2. Since Mo, & Moy 419 from (3.10b), the spaces 
MYo, Mts, Ms, Mo, Mp», M36 


also have dimension 1. In particular, up to multiplication by a constant, 
there is only one cusp form of weight 12, namely A(r). 
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We begin by proving the Uniformization Theorem for elliptic curves, which 
was stated but not proved in [AEC VI.5.1]. This theorem says that ev- 
ery elliptic curve over C is parametrized by Weierstrass elliptic functions. 
Our main tool will be Theorem 3.7(a), which says in particular that every 
modular function of weight 0 defines a meromorphic function on the Rie- 
mann surface X(1). For a more elementary, but less intrinsic, proof of the 
Uniformization Theorem, see exercise 1.11. 


Definition. The modular j-invariant j(7) is the function 


g2(T)* 


j(r) = 1728 AG) 


Thus j(7) is the j-invariant of the elliptic curve 
Ey, : y? = 42° — go(7)x — gs(r), 
and E,_(C) has a parametrization using the Weierstrass g-function, 


C/A, =? Ex, (C), 
z > ((2; Ar), 9'(z; Ar). 


(For details, see [AEC VI.3.6].) 


Theorem 4.1. j(7) is a modular function of weight 0. It induces a 
(complex analytic) isomorphism 


j:X(1) PC). 


Proor. From (2.4.2) and (2.4.3), both A(r) and go(r)? = 2°335°G4(r)? 
are modular forms, and both have weight 12, so their quotient is a modular 
function of weight 0. By (3.7a) with k = 0, j defines a meromorphic 
function on X(1). (N.B. This means that j is meromorphic relative to 
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the complex structure on X(1) described by (2.5).) Hence j gives a finite 
complex-analytic map 
gj: X(1) — P*(C). 


Finally, we note that go(ico) = 120¢(4) 4 0 (3.4.2) and A(ico) = 
0 (3.4.3). Since A has weight 12, (3.8) implies that 


ord, A= 1. 


Thus 7 has a simple pole at the cusp oo € X(1) and no other poles on X(1), 
so the map j : X(1) — P!(C) is an analytic map of degree 1 between 
compact Riemann surfaces. It is therefore an isomorphism. Oo 


Corollary 4.2. Let f be a modular function of weight 0. 

(a) The function f is a rational function of j, that is, f € C(j). 

(b) If in addition f is holomorphic on H, then f is a polynomial function 
of j, that is, f € Cl]. 


PRooF. (a) From (3.7a), f defines a meromorphic function on X(1), and 
so by (4.1), f 077! is a meromorphic function on P!(C). But the only 
meromorphic functions on P!(C) are rational functions, so 


foj l(t) = P(t) for some P(T) € C(T). 


Substituting ¢ = j(z) with z € X(1) gives f(x) = P(j(2)). 

(b) From (a), we know that f = P(j) for some rational function P(T) € 
C(T). Suppose P is not a polynomial. Then there is a to € C such 
that P(to) = oo. The isomorphism j : X(1) > P!(C) from (4.1) sends H 
to C c P}(C), so we can find a 7) € H with j(79) = tp. But then f(t») = 
P(j(70)) = P(to) = 00, contradicting the assumption that f is holomorphic 
on H. Hence P(T) must be a polynomial. Oo 


Corollary 4.3. (Uniformization Theorem For Elliptic Curves over C) 
Let A,B € C satisfy 4A® +27B? #0. Then there is a unique lattice A C C 
such that 


g2(A) = 60G4(A) =-4A and — g3(A) = 140G6(A) = —4B. 


The map 
C/A — E: y=a254+Ar+B, 
z + (w(z;A), 50'(z; A) 


is a complex analytic isomorphism. 
PROOF. Using Theorem 4.1, we can choose a 7 € H such that 


4A3 


to ne ee 
I(t) = 10287 oR 
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Assume first that AB # 0. It follows from this and the definition of j(r) 


that 
27 BY 1728 aa(t)” ( B ) (22) =~ 
4A3j(T) g2(T)> ? 93(T) A 
Let 
A 
= an 5 and A=adA, = Zar + Za. 
2 
Then 5 ’ 3 
= B*ga(r 
= 4 = es be 
g2(A) =a g2(A,) — A293(r)? 4A, 
B3go(r)3 
= a6 = 2 
g3(A) =a g3(A,) ~ A3q3(T)2 4B. 


Similarly, if A = 0, then j(7) = 0 and go(r) = 0, whereas if B = 0, 
then j(7) = 1728 and g3(r) = 0. Hence in these two cases it suffices to 
take A = aA, with 


SBT: cop BORIS 
a= “4B if A = 0, and a= ary if B=0. 


This gives the existence of A. Since we will not need the uniqueness of A 
in our subsequent work, we will leave this fact to the reader. (See ex- 
ercise 1.12.) Finally, we note that the second part of Corollary 4.3 is 
essentially a restatement of [AEC VI.3.6b]. Oo 


We are now ready to relate the function j(7), defined as a meromor- 
phic function on the Riemann surface X(1), to the j-invariant defined 
in [AEC III §1] which classifies isomorphism classes of elliptic curves. We 


let 
{elliptic curves defined over C} 
ELLe = eS. 
C-isomorphism 


Thus an element of E£LL£_¢ is a C-isomorphism class of elliptic curves. We 
also recall the notation 

£ = {lattices in C} 
from §1. Much of our preceding discussion is summarized in the following 
proposition. 


Proposition 4.4. There are one-to-one correspondences between the 
following four sets, given by the indicated maps: 


CLL sees Lic s- TH) A 
{Ea} — {A} ={Ar} — TF — j(r). 
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Here A, = Zr + Z, {E,} denotes the C-isomorphism class of the elliptic 
curve Eq : y? = 4x3 — go(A)x — g3(A), and {A} is the homothety class of 
the lattice A. 


PROOF. Since j(ico) = oo, the bijectivity of T(1)\H 4, Cis (4.1). The 
bijectivity of [(1)\H > £/C* is (1.2bc). Finally, the injectivity of £/C* — 
ELL ¢ is [AEC 4.1.1] and the surjectivity is (4.3). Oo 
Let us describe in a bit more detail the bijective map 
ELLe — C 

given in Proposition 4.4. Let {EF} € E£LL¢ be an isomorphism class of 
elliptic curves, and choose a Weierstrass equation 

E:y=2°+Ar+B 
for some curve E in this class. Now take a basis 71, y2 for the homology 
group H,(E(C),Z), and compute the periods 

d. 
& and We = a 

V1 y 2 y 
(See [AEC VI §1].) Switching w; and we if necessary, we may assume that 


w= 


TE= as EH. 
we 
Then evaluate the holomorphic function j(7) at 7 = Tg. 
Thus the map 


j:ELLe —C, {E} += j(te) 
involves two transcendental (i.e., non-algebraic) operations, namely the 
computation of the periods w),w2 and the evaluation of the function j(r). 
From this perspective, it seems unlikely that rationality properties of j(7z) 


should have anything to do with rationality properties of E. To describe 
the relationship that does exist, we make the following two definitions. 


Definition. Let {EF} € ELLc, and let K C C. We say that K is a field 
of definition for {F} if there is an elliptic curve Ep in the isomorphism 
class {E} such that Ep is defined over K. We say that K is a field of 
moduli for {E} if for all automorphisms o € Aut(C/Q), 


E’ € {E} ifand only if acts trivially on K. 


Note that the field of moduli exists and is unique, since by Galois 
theory an equivalent definition is that the field of moduli is the fixed field 
of the group 

{o € Aut(C/Q) : E? € {E}}. 
From the complex analytic viewpoint described above, it is not clear that 
the number j({£}) should have any relationship to fields of definition and 
moduli for {£7}. Note that there are lots of bijections ELL — C. For 
example, j’({E}) = e"j({E}) +7” is also a bijection. But clearly, it is 
not possible for both j and 7’ to have good rationality properties. 
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Proposition 4.5. Let {FE} € ELLc. 
(a) Q(j({E})) is the field of moduli for {E}. 
(b) Q(j({£})) is the minimal field of definition for {E}. 


PROOF. The j-invariant j(E) of the elliptic curve 


E : y® = 423 — go(r)x — g3(T) 


T 3 


so for any o € Aut(C/Q), 
J(E*) = j(E)’. 
(a) From [AEC III.1.4b] we have 
E° € {E} ifandonly if j(E7) = j(E). 


Since j(E%) = j(E)°, this shows that Q(j(Z)) is the field of moduli 
for {EF}. 
(b) We know from [AEC III.1.4bc] that there exists an elliptic curve Eo 
defined over Q(i(E)) with j(Eo) = j(£), and so satisfying Eo =/c E. This 
shows that Q(j(£)) is a field of definition for {EF}. 

On the other hand, if K is any field of definition for {EZ}, let Eo/K be 
a curve in {EZ} given by an equation 


Ey: y=2°+Ar+B with A,Be K. 


Then 
4A3 é 
4A3 + 27B? 


so Q(j(E)) CK. Oo 


j(E) = j(Eo) = 1728 K, 


Remark 4.6. The reader should note that the proof of Proposition 4.5 is 
very elementary because we have explicit Weierstrass equations with which 
to work. (This is how [AEC III.1.4bc] was proven.) For modular curves 
of higher level the problem becomes considerably more difficult, since one 
cannot rely on explicit equations. (See Shimura [1, §6.7].) Finally, we 
should mention that an analogous statement is false for abelian varieties of 
higher dimension; the field of moduli for an isomorphism class of abelian 
varieties need not be a field of definition. 
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§5. Elliptic Functions Revisited 


Let A Cc C be a lattice. Our fundamental elliptic function is the Weier- 
strass o-function, 


f 1 1 
eRe 2s fee see eeysely 
(2; A) ee (ty =) 

wer 

w #0 
As we have seen [AEC VI §3], o defines a meromorphic function on the 
elliptic curve C/A. It has a pole of order 2 at 0 € C/A and no other poles. 
We have also computed the Laurent series of around z = 0 [AEC VI.3.5al, 


1 loo 
e(25 A) = + DK + NGax+2(A)2™, 
k=1 


valid for |z| less than the smallest non-zero vector in A. 

Since o(z; A) has no residues, we can integrate it to find a new function 
which will almost be periodic for the lattice A. Note, however, that when 
we integrate the series for g(z; A) term-by-term, it is necessary to adjust 
the constant of integration in each term so as to ensure convergence. 


Proposition 5.1. (a) The series 


1 1 1 
caM=5+ 0 (o+eta) 


wer 
wH40 


is absolutely and uniformly convergent on compact subsets of C \ A. It 
defines a meromorphic function on C with simple poles on A and no other 
poles. ¢(z;A) is called the Weierstrass ¢-function (associated to the lat- 
tice A). 

(b) The Laurent series for ¢ around z = 0 is 


1 CO 
C(z; A) = a S © Gorga(A)22eth, 
k=1 
PRooF. (a) Let CC C\ A be a compact set, and let 
e=inf{|z-w|:2€C,we A} and M = sup{|z|:2¢€ C}. 


Since C' is compact, we have « > 0 and M < o. 
Let z € C, and let w € A satisfy |w| > 2M. Then 


1 gia |e 1 < 2M? 
2 we pk | jee 
w 
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On the other hand, there are only finitely many terms in the sum with 0 < 
|w| < 2M, and for z € C those terms all satisfy 


1 di. 1 ie 1 é. 1 % |z| 2 1 i 1 i M 
z-w w wl} jz—-wl lw] fol? ~e lw] wl?” 

Hence 
1 1 1 2 1 1 1M 2M? 
= SS are -+—+-—> ———, 
he Z-W Tee <2+> (2 atoet & |w|3 

wer wed wer 

w#0 0<|w|<2M w>2M 


We know [AEC VI.3.1a] that the last series converges, which proves the 
series defining ¢(z; A) converges absolutely and uniformly on C. 

It follows that ¢(z;A) is holomorphic on C \ A, and an inspection of 
the series defining ¢ shows immediately that it has simple poles at each 
point of A. 

(b) Let z be a complex number such that |z| < |w| for all non-zero w € A. 
Then 


1 1 z 1 1 z 
+=+—55 he 
Z-W WwW w ws) 4 ee w 
Dal): 
w i \w 
so 
1 1 z\h 
ceundate DEE) 
wer k=2 
wH#0 
1 ; 
== - So Geri (A)z 
k=2 
This is the desired series once one notes that G;,(A) = 0 for odd k. Oo 


Differentiating the series (5.1a), we see that ¢’(z; A) = —g(z; A). Thus 
the derivative of ¢(z; A) is periodic for the lattice A, so ¢ itself will have some 
sort of “quasi-periodicity” property as explained in the following proposi- 
tion. 


Proposition 5.2. (a) For all z €C, 
56a A) = -olesA) 
que PLZ; Ah). 


(b) For allw € A and all z € C, 


C(2 +w; A) = ¢(z;A) + 0), 
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where the number 7(w) is independent of z. The map 
n:A—-C 


is called the quasi-period map associated to A. Ifw € A andw € 2A, 
then n(w) is given by the formula 


_ nw) = 2¢(4w; A). 


(c) The quasi-period map is a homomorphism of A into C. 
(d) (Legendre Relation) Let A = Zw, + Zw» be a lattice with basis satis- 
fying Im(w;/w2) > 0. Then 


W1(W2) — w2n(w,) = 27%. 


Proor. (a) The series (5.1a) defining ¢ converges absolutely and uniformly, 
so it can be differentiated term-by-term. The result is the defining series 
for —¢. 
(b) F : 

ee Aes eN\ Se ‘ eee . ’ 

Tle tus A) = -ole tus A) = oles A) = Bele) 


Integrating, we find that the quantity 


nw) = C(z +w; A) — ¢(z; A) 


is independent of z. If, further, w ¢ 2A, then ¢ does not have a pole at thw. 
1 


Putting z = —5w and using the fact (evident from the defining series) 
that ¢(—z; A) = —C(z; A), we find in this case that n(w) = 2¢($w; A). 
(c) We compute 


n(w + w’) = C(z +w +"; A) — C(z; A) 
= {C(z tw +u"A) — C(z +; A)} + {C(z +0; A) — (2; A)} 
= n(w') + nw). 


(d) We integrate ¢(z; A) around a fundamental parallelogram offset slightly 
so as not to contain points of A on its boundary. Thus let D be the region 


D= {a+ tyw, + tows :0< ti,tea< 1}, 


and let 
OD=1,4+ Ll2+13+L4 


be its boundary as illustrated in Figure 1.5. 
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at, + @, 


L; 


An Offset Fundamental Domain for C/A 
Figure 1.5 


The only pole of ¢ in D is a simple pole of residue 1 at z = 0. (Look 
at the series (5.1a) defining ¢.) Hence 


i ¢(2; A) dz = 2rt. 
aD 


On the other hand, using (b) we get some cancellation when computing 
the line integrals over opposite sides. Thus 


1 0 
if ¢(z; A) dz =| Cla+ tun: A)undt + f C(a + wy + two; A) wodt 
Ii+Ls 0 1 


1 1 
- / C(a + two; A) wodt — | (C(a + tw) + n(w1)) wedt 
0 0 


= —7(w)we. 
Similarly, 
/ C(z; A) dz = n(we)w. 
Lot+La4 
Therefore 
Qri = ¢(z; A) dz = i C(z; A) dz = n(we)w, — n(wi we. 
aD £1+L2+L3+La 


Oo 
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Remark 5.3. Let E/C be an elliptic curve given by a Weierstrass equa- 
tion, and let 
dx 


ee 
a 2y+a,x+ az 


be the associated invariant differential. The lattice A for E is the set of 


periods 
i: WE, 
_ 


where ¥ runs over all closed paths on E(C). (Equivalently, y runs through 
the cycles in H;(E(C),Z). See [AEC VI 81].) The classical name for an 
everywhere holomorphic differential such as wz on a Riemann surface such 
as E(C) is a differential of the first kind. 

Similarly, a differential of the second kind is a meromorphic differential 
with no residues (i.e., with no simple poles), and a differential of the third 
kind is a meromorphic differential with at worst simple poles. 

The differential 


p(z;A)dz=x2we 


is thus a differential of the second kind on C/A, and its indefinite integral 
is the multi-valued function —¢(z; A). The indeterminacy in ¢ is given by 
the numbers 


atw 
/ J52 i} plz; A) dz = —C(a-+w; A) + C(a; A) = —n(w), 


where w = i Wp is the period associated to the closed path +. 
In terms of our original Weierstrass equation, there is the period map 


H(EO) Dt, yes [ve 
#Y. 


whose image is the lattice A. Using this to identify A with the first ho- 
mology of £(C), we see that the quasi-period map associates to a path the 
negative of the corresponding period for the differential cw: 


7 : Hy(E(C),Z) — C, yrRo - f ewe. 
7 


The last function we want to examine is essentially the integral of ¢. 
To eliminate the indeterminacy caused by the simple poles of ¢, we take 
the exponential of the integral. This leads to a familiar function which we 
used in [AEC VI §3] to construct elliptic functions with a given divisor. 
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Proposition 5.4. (a) The infinite product 


z 2 
: = — 2) ez/wt+(1/2)(z/w) 
o(z;A) =z ll (1 =e 
wed 
wHA0 


defines a holomorphic function on C with simple zeros on A and no other 
zeros. It is called the Weierstrass o-function (associated to the lattice A). 


(b) 


2 


d d 
re log a(z; A) = C(z; A), iZ 


(c) For allz €C andweA, 


log a(z; A) = —(z; A). 


o(2 +A) = Yw)e™ E+ 4 a(z; A), 
where 7): A — C is the quasi-period map for A, and w is defined by 


p:A— {+1}, bwy=(? ifw € 2A; 


=i df £94, 


PROOF. (a) This is a restatement of [AEC VI.3.3al]. 
(b) Taking the derivative of 


oga(s:A) ='os(s)+ > {log (1-2) +2 +2 (2)'} 


weld 
wH#0 


gives the defining series (5.1a) for ¢, and then from (5.2a) we see that the 
second derivative is —g. Note that the logarithms are locally well defined 
up to the addition of a constant which disappears when we differentiate 
and also that we must take the principal branch of log (1 - =) for almost 


all w in order to ensure the convergence of the series. 
(c) From (b) and (5.2b), 


d 1 a(z+w; A) 


rea = ¢(2 +4;A) — ¢(z;A) = nv), 


sO 


o(z +w;A) = Ce")? a(z; A) 


for some constant C not depending on z. Note also that o is an odd 
function, a fact that is clear from the product defining o. 

We consider two cases. First, ifw ¢ 2A, then o does not vanish at thw. 
Hence putting z = — Sw gives 


o (w;A) = Cele g (—3u; A) = —Ce7 21) g (Su; A), 


so 
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C = —e2 le, 


Next, if w € 2A, then o has a simple zero at teu. Using L’H6pital’s 
rule yields 


mat 
-—$nww _ yj a(z +w; A) Lee (gw; A) = 
or Meat ae ere ¥) a” (—4w; A) 


(Note that o’ is an even function, since o is odd.) Hence in this case we 
find that : 
C= e2nw)y 


which completes the proof of (c). Oo 


Any elliptic function can be factored as a product of Weierstrass o- 
functions reflecting its zeros and poles. We give a general result and two 
important examples. To ease notation, since the lattice A is fixed, we will 
write o(z) and g(z) instead of o(z;A) and g(z; A). 


Proposition 5.5. Let f(z) be a non-zero elliptic function for the lat- 
tice A. Write the divisor of f as 


div(f) = >> ni(ai) 
i=1 
for some a; € C, and let 


s 
b= 5 N4Q;- 
i=l 


(See [AEC VI §2] for the definition of the divisor of an elliptic function.) 
Then there is a constant c € C* so that 


f(z) = = a a ay IT —ai)™. 


Corollary 5.6. 


(a) o(2) — (a) = - “a. 


a(2z) 
a(z)4” 


(b) p'(z) = — 
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PrRooF (of Proposition 5.5). Let 


Cae 9 ee 
9(2) 5 et ai)”. 


a(z- 
From [AEC VI.2.2c] we know that b € A, so using (5.4c), we find that 


a(z) b)(z—2 
(2) en(b)(2— 3) 
o(z—b) 


is holomorphic and non-vanishing on all of C. Since o(z) has simple zeros 
on A and no other zeros, it follows that g has exactly the same zeros and 
poles as f. Hence f(z) / g(z) is everywhere holomorphic. 

Next we verify that g is an elliptic function. Let w € A, and use (5.4c) 
to write 


for certain constants A and B which depend on w but not on z. Then 


giz+w) __o(z—0) rete) yy (sates acre ao\" 


gz) a(z—b+w) af(z o(z—a;) 


r P 
— e~ B(z—)) pBz II Cos) i 


i=1 
— eB(b-Eniai) (AeB2)"" eis 


The last equality follows from the definition of b and the fact [AEC VI.2.2b] 
that the divisor of an elliptic function has degree 0. 

This proves that g(z) is an elliptic function, and so f(z)/g(z) is an 
everywhere holomorphic elliptic function. From [AEC VI.2.1] we conclude 
that it is constant. oO 


PROOF (of Corollary 5.6). (a) Since g(z) is an even function of order 2, 
we see immediately that the zeros of 9(z) — g(a) are a and —a. Thus 


div(@(z) — e(a)) = (-a) + (a) — 2(0). 
Applying (5.5) we find 


o(z + a)a(z —a) 
a(z)? 


for some constant C. Multiplying by z? and using 


9(z) — p(a) =C 


lim 229(z)=1 and lim 2\*? 
2 
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gives the value of C, 


1 = Co(a)o(—a) = —Co(a)?. 


(b) Divide (a) by z — a and let z — a. This yields 


9 (a) = — lim ae= a) one = —0'(0)——. 


Since o’(0) = 1, this is the desired result. o 
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As we have seen in §§1-4, it is often convenient to use normalized lattices 
A, =Zr+Z with 7 € H. 
We then use the obvious notation 
(2:7), C47), o(237) for (2; Ar), (2: Ar), o(2 Ar). 


We will soon see that 9, ¢, and o are quite well behaved when considered 
as functions of two variables (z;7) € C x H. 
Note that since 1 € A,, the g function satisfies the relation 


e(z +137) = 9(z;7). 
This means that it is possible to expand g as a Fourier series in the vari- 
able u = e?'*, Similarly, since A;41 = A,, the g function satisfies 

(237 +1) = p(2;7). 


Thus, as a function of 7, the g function should have a Fourier expansion 
in terms of g = e277. 
This idea can be formulated more intrinsically as follows. Let 


u =U, = e2™ and g=qr=e ; 


and let 
q? = {q* tke Z} 


be the cyclic subgroup of C* generated by g. Then there is a complex- 
analytic isomorphism 


C/A, —> C*/q?, zee u = ett, 
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Note that this is an isomorphism of complex Lie groups, since it is clearly 
a homomorphism. 
Our first step is to express ¢(z;7) as a power series in the variables u = 

and q = e?"'7. The quickest way to do this is to write down (by 
magic?!?) the correct expression, and then verify that it gives the same 
function as g(z;7). We opt instead for a somewhat lengthier, but hopefully 
more perspicuous, derivation. 

Consider first the series 


ola M=5+ > coo ae 


wer 
wH#0 


e2tiz 


defining ¢. How does it arise? From [AEC VI.2.3] we know that any non- 
constant elliptic function must have at least two poles, so we look for a 
meromorphic function F(z) satisfying 
(i) F(z+w) = F(z) for allz EC, we A; 
(ii) F(z) has a double pole at each point in A and no other poles. 
The simplest function with a double pole at w is (z —w)~?. By aver- 
aging over w € A, we find a series 


F(z) = ear: 


wed 


which formally satisfies (i) and (ii). The problem is that this series is not 

absolutely convergent. However, by subtracting an appropriate constant 

from each term, we can create a series which does converge and has the 

desired properties. This is how we “discovered” g(z; A) in [AEC VI §3]. 
We apply the same principle to express o(z;7) as a function of u and gq. 

Exponentiating the conditions (i) and (ii), we look for a function F(u; q) 

satisfying 

(iii) F(q*u;q) = F(u;q) for all u € C*, k € Z; 

(iv) F(u;q) has a double pole at each u € q” and no other poles. 


As above, we look for F' to be an average 


F(u;q) = >> f(q"u) 


neZ 
for some elementary function f. Such an F' will clearly satisfy the period- 
icity condition (iii). 
To obtain (iv), we need f(Z) to have a double pole at T = 1. For 
example, we might use f(T) = (1—T)~?. But the series 


1 
rer 


neZ 
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does not converge, since |g] < 1. The terms with n — —oo are all right, 
since then q” — oo. But as n > oo, the n™ term goes to 1. 

In order to get convergence, we want f(T) to have a double pole at T = 
1 and also to satisfy 


jim f(T) =0 and jim f(T) =0. 


The simplest such function is f(T) = T(1—T)~?, which leads us to consider 
the function F’(u;q) in the following lemma. | 


Lemma 6.1. Let 
qu 
Fag =) a 
os Ug) 

(a) The series defining F’, considered as a function of z, converges abso- 
lutely and uniformly on compact subsets of C \ A;. 
(b) F is an elliptic function for the lattice A,. It has a double pole at 
each z € A, and no other poles. 
(c) The Laurent series for F around z = 0 begins 


1 1 q” 
fui = ee eo Ye ; 
(u; g) (nie 1D dX a-¢,p + (powers of z) 
ProorF. (a) Note that 
q’u grunt 


(= gue Get 


We use this identity to rewrite the terms in F having n < 0. This gives an 
alternative expression for F, 


Re gee q’u grunt 
Pid) = Gop + Dg t pee} 


n>1 


Now let C C C\ A, be a compact set. Then u = e?""* is bounded away 
from 0 and oo uniformly for z € C. Since gq” — 0asn — &, it follows that 
there are constants c, and c2 so that 

gnu -1 
(1 — g™u)? 


q’u 
(Lqru ty? 


< cilg”| for allz EC, n> co. 


This shows that except possibly for the terms with n < co, the series is 
absolutely and uniformly convergent on C’. 
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Consider now one of the finitely many terms with n < co. We know 
that u 4 q” for z € C, so the compactness of C’ ensures that 


inf {1 — q”u="| > 0. 
inf | qhu~"| 


Hence the terms with n < cz are also uniformly bounded. 

(b) From (a), F’ is a holomorphic function on C \ A,, and looking at the 
series defining F’, it is clear that F has a double pole at each point in A,. 
Finally, since the transformations z +> z+ 1 and z+ z+7 correspond 
tour u and ue qu respectively, it is clear again from the series that F 
is an elliptic function for the lattice A,. 

(c) Note that wu = e?"* > 1 as z — 0. Hence the pole at z = 0 in the 
series for F’' comes from the term with n = 0. Now a little freshman calculus 
yields 


Z alli : Eck (aiyteersatih ) 
= = . wer Zz). 
(—u)? —e2mey?  (Qnizy? 127 SP 


Hence using the alternative series for F' given above, we find 


tim, { Flwia = ony + a} =e {Fug 7 ard 


Theorem 6.2. Let u = e?""* and q=e 


nez >1 
= qu gqrua} 
no (1 = qru)? 2 el = qru-1)? 
1 gq” 
PGR DY 5 
12 0) 
1 qru(i + q”u) 
b "(z; 
1D) aya? i) X G— qu 
-S> qvu(l + q”u) - qrua(i-+q"u*) 
a, Aba ee gra e 
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Remark 6.2.1. For |X| < 1 there is the elementary identity 


ee 2) Se 


mo>1 


This is sometimes used to rewrite the final sum in (6.2a) as 
mq™ 
n>1 n>lm>1 m>1 


PrRooF (of Theorem 6.2). (a) Let F(u;q) be as in (6.1). Consider the 
function 


aepam(ei7) - (ua) +35 2 oP 


From (6.1b) we see that this expression is an elliptic function for the lat- 
tice A, which is holomorphic on C \ A,. Further, comparing the Laurent 
series for F’ given in (6.1c) with the known Laurent series for g, we see that 
it is also holomorphic at z = 0, and in fact it vanishes there. It thus repre- 
sents an everywhere holomorphic elliptic function which vanishes at z = 0. 
Applying [AEC VI.2.1], we conclude that it is identically zero. This proves 
the first equality in (a), and the second is an easy rearrangement of the 
terms in the initial sum. (See the proof of (6.1a).) 
(b) Apply - = miu to (a). oO 
The next step is to find a qg-expansion for ¢(z;7) analogous to (6.2). 
By construction, 


d 
5687) = — (27), 


so we try integrating the series (6.2a) for o term-by-term. Proceeding 
blindly, we find 


[ee a=] q’u du = 1 1 
(l-—@q aye? (l—q®u)? 2niu  2ni (1 — ghu) 


Unfortunately, the series 
1 


neZ 


is clearly divergent, the n*® term goes to 1 as n — oo. But just as in 
the original definition of 9, we can improve the convergence by adding a 
constant onto each term. 

The second expression for g in (6.2a) has the form 


1 é qru 
(2zi)? ae De As nae p> (1 —gru-ty2 7 on 


n>1 


where 
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roa a Lae 
(We will soon identify C, more precisely.) Now integrate: 
56lei7) = pa] ee dz= Lees 
~- | ae +L ae ewip tO 


n>0 


Tie fo SS 
= ——+—— — 27iC, z+ C2 
= pote 1 
n>0 1 g° o n>1 1 i ue 
—g"u U 1 
=~ + pt 5 — 2ri0y2 + Cr 
n>0 cree a a 


for some constant of integration C2 = C2(q). Note that the last series is 
absolutely and uniformly convergent on compact subsets of C \ A;, so it 
defines a meromorphic function on C. (The proof is identical to the proof 
of (6.1a).) Further, the (d/dz)-derivative of this series can be computed 
term-by-term and agrees with the series (6.2a) for g. This proves that it 
equals ¢ for some choice of C2. 

To find C2, we compute the first few terms of the Laurent series 
around z = 0. We already know (5.1b) that 


¢(z;7) = : — G,(r)z? + higher powers of z. 


On the other hand, the pole at z = 0 (i.e., at u = 1) in the above q-series 
comes from the n = 0 term, so we find 


—u —qru qruat : 
————— > -2niC’ C. 
+d (+a TIC Zz + C2 
n>1 
te” 
vanishes at z = 0 (u = 1) 


e27z 


= — [ogame + C2 + (powers of z) 


1 1 
TGs tat C2 + (powers of z). 


Since the Laurent series for ¢ has no constant term, we see that C2 = —3. 
This proves part of the following theorem. 


Theorem 6.3. Let ¢(z;7) be the Weierstrass ¢-function and n: A; — C 
the quasi-period homomorphism associated to A. 


(at) Clair) = rao rere So + epal)z ~ 5 


207% 


a0! 
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1 
(b) aaae") = | _ 3) +4) aap 


Rael 


PrRooF. Let 
—q'u gru-* 
G(z;T) = ——— _ and 
(237) . oe a fe 


n>1 


We proved above that 
1 d 

ng 047) = G(z;T) — 2miC1(q) — = 
Now evaluate at z = 3. From (5.2b), ¢($;7) = n(1). Further, z = $ 


corresponds to u = e”’ = —1, so all of the terms in G(5;7) cancel except 
the n = 0 term. Thus 


1 1 1 1 , 1 ¢ 
——7(1) = ami? (5:7) =G (5:7) — 2niC1(q) - 5 —27iC1(q), 
so , 
Ci(q) = - > 701). 
This completes the proof of both parts (a) and (b). oO 


Finally, we integrate the series for ¢(z;7) and exponentiate to obtain 
an important g-product expansion for o(z;T). 


Theorem 6.4. The Weierstrass o-function has the product expansion 


1 oi Be (1 — q™u)(1 — g™u7!) 
— e312 p—wiz(y _ oT Ae Oe ey. 
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ere =e 


where u = e?"** and q = e?™*7 as usual, and n(1) is the quasi-period 


associated to the period 1 € A,. 


PROOF. By construction, 


om = - log o(z;T) = C(z;7). 
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Using (6.3a) and integrating gives 


& ie gut du as 
-~/ 5 +> /Sa - + fade mi) dz 


n>0 n>1 
= Sy log(1 — g”u) + > log(1 — q”u7!) + $n(1)2? — miz+ C3. 
n>0 n>1 


We claim that the series will converge provided we use the principal branch 
of the logarithm when evaluating log(1 — qu) and log(1 — q"u7'). To see 
this, note that for n sufficiently large we have |g”ut!| < 5. So, for all but 
finitely many n, 


ah) q” u*?) 


<2 \qru="| , 


Cale 


Jlog(1 — q”u 


Hence the series will converge. 
Exponentiating, we eliminate any ambiguity arising from the choice of 
a branch of the logarithm and obtain the product representation 


o(z;T) 43 e2n(l)z?—nizt+Cs I[a = q’u) IIa ons q’u_'). 


n>0 n>1 


It remains to find C3. Recall that o was normalized by the condition 
that a(z;7)/z —lasz-—0. It is the n = 0 term in the product which 
vanishes at z = 0, so we find 


1 = lim aao) 
20 Zi 


1 : 1- 
Sm eae ees () ][G-@wd -au") 


z—0 
ual n21 


= e(—2ni) [[(-4")? 


n>1 


Hence 
C3 _ 


1 1 
sa ll — qnr)2’ 
2ni os, (A q”) 


which gives the desired product formula for o(z;7). Oo 
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§7. g-Expansions of Modular Functions 


The Eisenstein series Go,(7) is a modular function of weight 2k. It sat- 
isfies Gox(T +1) = Gox(7), so it has a Fourier expansion in terms of the 
variable g = e2"*7. In this section we will compute the Fourier series of Gox 
and use it to deduce various properties of the Fourier expansions for A(T) 
and j(r). 


Proposition 7.1. Let k > 2. Then 
a 
Gox(T) = 2¢(2h) + 2 = 2 ie i(n)q” 


where 


¢(s) = as and o.(n) = yoa* 


n>1 d\n 


are respectively the Riemann ¢-function and the k*>-power divisor func- 
tion. 


PROOF. 
Gox(T) 


1 
DS. eae 
Rael (mr +n) 
(m,n)A(0,0) 


2 a 2 Dae ae 


nez m=1neZ 
n£~0 


The first sum is just 2¢(2k). Notice that the rightmost (inner) sum is 
clearly invariant under 7 +> 7 + 1. We now compute its Fourier expansion. 


Lemma 7.1.1. Let k >1 be an integer. Then for allt € H, 


1 oa = 2k-1 e2tirr 
Pe (rF+n)* (2k = 1)! 4 2 


neZ 


PROOF. Ignoring questions of convergence, we have a formal identity 
1 W. . sae® 
> pare a log(7 + n) 
2k ery: i qk, O8 
mr (r +n) faa (2k a. d 
1 
= GER aaa bare): 


Of course, this product does not converge. But we do get convergence if we 
factor an n out of each term. (Remember this is just a formal manipulation 
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to see what the answer should be. Otherwise one might rightly object that 
dividing by (co!)? is a highly dubious procedure.) The product 


2 
= 
r 1 “) = S eee 
MI (1+2)="T1 (1-3) 
neZ n=1 
n#0 
converges to give a function that is holomorphic on C, has simple zeros 
at each integer, and no other zeros. With this description, the reader will 
undoubtedly recognize the usual product expansion for the sine function. 


(See Ahlfors [1].) 
2 
sin (nr) = a7 J] G-3). 


n=1 
We now reverse our formal argument to produce a rigorous proof. 
Starting with the product expansion of the sine function, we take the log- 
arithmic derivative, yielding 


d 1 <geee, OF 
pr =a ee 
i en 1 
=14 (= —) 


Now taking (2k — 1) more derivatives, we find 


2k 


d tees 
“pe, log(sinnr) = —(2k — 1)! {= et ~ Were Lae Ge re) 
aes) a) 


Next we compute the Fourier series of (a branch of) log(sin 7). Writ- 
ing 
. 1 iT —7t —7t 200T 
sin(rr) = - (e™"" —e-™"7) = —Se"™ (1-€77"7), 
we find (for 7 € H) 


log(sin rr) = —log(—2i) — mir + log (1 — e2nir) 


— log(—2i) — mir — oe se 


r=1 
Differentiating 2k times (with k > 1) yields 
ate log(sin a7) = Sami) By 2h ger 
pe, 8 ~ : 


r=1 
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Equating this expression for d?* log(sintr)/dr?* with the expression ob- 
tained above gives the desired result. 
O 


We resume the proof of Proposition 7.1. Applying (7.1.1) with m7 in 
place of 7, we find 


Gok (T ) = 2¢(28) 42> oie 


m=1 neEZ 


= 2¢(2k) + 23 — = sh. J r2k-learirme 
(2k — 1)! 


1 ix lr=1 


= 2¢(2k) + RD oa ; 3 So tk Le2mine, _ 


n=1 rin 


As is well known, ¢(2k) is a rational multiple of 1?*. It is frequently 
convenient to factor a 1?* out of the Fourier series for Go,, yielding a series 
with rational coefficients. We briefly recall the details concerning special 
values of the Riemann ¢-function at even integers. 


Definition. The Bernoulli numbers B, are defined by the power series 
expansion 


oe k 
£ £ 
= B 
er — 1 Ss . 
k=0 
For example, one easily computes 
1 1 1 
Bo = 1, By =—-, Byg=-, Ba= Dr 
c a. od 26 oS a0 
1 1 
Be 42° Bg = ~ 30° and Bor+1 = 0 for allk > 1. 


For a longer table of B;,’s and the corresponding values of ¢(2k), see (A §1). 
Proposition 7.2. For all integers k > 1, 


eof Qni)2* 
62k) = D> = ee Big 


n=1 


PROOF. First we use the definition of the B,’s to write 


emt + e7me 2 
rx cot(ra) = nix ——_____ = nix { 1 + ———— 
emt _ enix e2tix = 


i) - \k : 
pe ‘ Br (27iz) = y Bon (2ria 
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Next we use the product expansion for sin(7z) already considered in the 


proof of (7.1), 
i = 1—--—}. 
sin(rz) = 7x II ( =) 


n=1 


Taking the logarithmic derivative yields 


22 1 


1 
mr cot(7ax) = ae + ae femaee 


8 iMe 


8 le 


Comparing the two Laurent series for 7x cot(mz) gives the desired result. 
Oo 


Remark 7.3.1. We can now define a normalized Eisenstein series E,(T) 
as the series 


Eox(T) = = I-s a ee i(n)q” 


RD 


Using (7.1) and (7.2) we see that 
Gox(T) = 2¢(2k) E4,(rT). 


The fact that the £2,’s have leading coefficient 1 makes them particularly 
easy to compare. For example, E? and Eg are both modular forms of 
weight 8. Since Mg has dimension 1 from (3.10.1), we know that they are 
multiples of one another. But since they are normalized, we see on com- 
paring their constant terms that E? = Eg. Equating Fourier coefficients 
gives the identity 


n—-1l 


o7(n) = o3(n) + 120 mE a3(m)o3(n — m). 


m=1 
The reader will be able to construct many more identities of this sort. 


Remark 7.3.2. We can also write go(7) and g3(T) in terms of normalized 
Hisenstein series: 


g2(T) = 60G4(r) = 120¢(4)E4(r) = (2n)¢ E4(7); 


273 


lI 


1 
gags ol). 


These expressions are useful for computing the Fourier expansion of A(T) 
and j(rT), as explained in the next proposition. 


g3(T) = 140G6(r) = 280¢(6)E6(r) = (2m)° 
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Proposition 7.4. (a) The modular discriminant has the Fourier expan- 
sion 
A(r) = (2)? $0 r(n)q”, 
n>1 


where T(1) = 1 and r(n) € Z for all n. The arithmetic function n +> T(n) 
is called the Ramanujan r-function. 
(b) The modular j-function has the Fourier expansion 


1 
Hr) =— + Di e(n)q”, 
q n>0 
where c(n) € Z for all n. 


PROOF. (a) Using (7.3.2) we compute 


A(r) = ga(r)* ~ 2709(7)2 = GDS (104(x) — Bon). 


We must show that every coefficient of E} — E? is divisible by 2°33 = 12°. 
From (7.3.1) we have 


Ex(r) =14+240$ o3(n)q" and =—Eg(r) = 1-504} a5(n)q”. 
n>1 n>1 
To ease notation, let us write 
Ex(r)=1+240A and = E£¢(r) =1—504B. 
Then 
E4(r) — Ee(r)? = (1 + 240A) — (1 — 504B)? 
= 127(54 + 7B) + 123(100A? — 147B? + 8000). 


It remains to show that every coefficient of 5A +-7B is divisible by 12. 
We have 


5A+7B = )-(503(n) + 705(n))q” = >. S_(5d* + 74°) q”, 


n>1 n2>1 d\n 
and for any integer d, 


d3(1 — d”) = 0(mod 4), 


bd? + 7d? = (5+ 7d’)=4 : 
d°(—1 + d*) = 0(mod 3). 
Hence 5d? + 7d®> = 0(mod 12). This proves that 


A(r) = (2m)? S2 r(n)q” 


n>1 
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for integers 7(n). 
Finally, the coefficient of q is 


(27)}? 
128 


- 12? - (503(1) + 705(1)) = (2m)??, 


so T(1) = 1. 
(b) We use (a), (7.3.1), (7.3.2), and the definition of j(T) to compute 
(2m)? 3 
—— E, 
g2(7)° 128 
j(7) = 1728 = 172312 _ 
( A(t) Om)? rime" 


n>1 


(1 +240 5° ca(nla") 


n>1 


a+ >_r(n)q” 


n>2 


Since the o3(n)’s and the 7(n)’s are all integers, this last expression gives a 
Laurent series of the form g~! + >> c(n)q” with integer coefficients. (Note 
the reciprocal of a power series with integer coefficient and leading term 1 
will again have integer coefficients. ) oO 


Remark 7.4.1. Using the formulas developed in the proof of (7.4), it is 
easy to compute the first few values of r(n) and c(n). Thus 


(2m) !2A(r) = q — 24g? + 25293 — 1472q4 + 48309 +---, 


j(r) =q7! + 744 + 196884q + 214937609? +---. 


For a more extensive list, see (A §2). 


Remark 7.4.2. In the next section we will prove that A(r) has the prod- 
uct expansion 
A(r) = (2m)¥q [[ (1 -@")™. 


n>1 
This gives an alternative (but less elementary) proof of (7.4a) 


Remark 7.4.3. The c(n) coefficients of j(7) have many interesting arith- 
metical properties. For example, Lehner [1,2] proved that they satisfy the 
following divisibility conditions. (See also Apostol (1, Ch. 4].) 


n =0 (mod 2°) ae c(n) = 0 (mod 2%¢+8), 
n =0 (mod 3°) = c(n) = 0 (mod 37¢+3), 
n =0 (mod 5°) => c(n) = 0 (mod 5°+1), 
n =0 (mod 7°) = c(n) = 0(mod 7°), 

n =0 (mod 11°) = c(n) = 0(mod 11°). 
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Remark 7.4.4. The values in (7.4.1) and (A §2) suggest that the c(n)’s 
grow quite rapidly. This is indeed the case, as is clear from the following 
asymptotic formula proven by Petersson [1] using the circle method of 
Hardy, Ramanujan, and Littlewood: 


eitvn 
c(n) ~ asn— oO. 


V2n3/4 


It also turns out that the c(n)’s are intimately connected with represen- 
tations of the largest sporadic groups, in particular with the Fischer-Griess 
monster group. See Conway [1] and Conway-Norton [1] for an interesting 
account of this surprising connection. 


Remark 7.4.5. Ramanujan’s 7-function also has many interesting prop- 
erties. For example, we will later prove (10.7) that it satisfies the identities 


T(mn) = T(m)r(n) if (m,n) = 1, 

T(p°t!) = T(p)r(p*) — p''r(p*"') for p prime and e > 1. 
These identities were conjectured by Ramanujan; the first proof was given 
by Mordell. 

We will also prove (11.2) that the 7(n)’s grow much more slowly 
than the c(n)’s. Precisely, we will show that there is a constant c such 
that |7(n)| < en® for all n > 1. Another conjecture of Ramanujan, proven 


by Deligne as a consequence of his proof of the Riemann hypothesis for 
varieties over finite fields, says that one can do better. 


Theorem 7.5. (Deligne [1,2]) 
|r(n)| < oo(n)ni}/? for alln > 1. 


(Here oo(n) is the number of divisors of n. For example, if n is prime, 
then oo(n) = 2.) 


In the other direction, there is the following open conjecture of Lehmer. 


Conjecture 7.6. (Lehmer [1]) 


t(n) #0 for alln > 1. 
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88. Jacobi’s Product Formula for A(7) 


In this section we will prove Jacobi’s beautiful product expansion for the 
modular discriminant A(r). 


Theorem 8.1. (Jacobi) 
A(r) = (2n)"q [[ GQ — 4"). 


n>1 


Remark 8.2. We will derive the product (8.1) directly from the definition 
of A(r) and the product representation (6.4) for the Weierstrass o-function. 
There are other methods which can be used to prove (8.1). For example, see 
Serre [3, Ch. 7, Thm. 6] for a proof based on rearrangement of conditionally 
convergent double series, and Apostol [1, Ch. 3, §2] or Siegel [1] for an 
exposition of Siegel’s clever proof using residue calculations. The heart of 
both of these proofs lies in first proving that the function 


F(r) =q][(-4")* 


n>1 
satisfies 
F(-1/r) = TF (r). 


Since F visibly satisfies 
F(r +1) =Fi(r) and lim F(r) =0, 
TUCO 


and since S and T generate the modular group I'(1), it follows that F is 
a cusp form of weight 12. Hence F(r)/A(r) is a holomorphic modular 
function of weight 0, so it is constant. Finally, letting 7 — ioo, one easily 
checks that this constant is (27)~1!?. 


ProoF (of Theorem 8.1). By definition, 
A(r) = go(r)° — 27g3(r)* 
is the discriminant of the cubic polynomial 
4X3 — go(r)X — g3(r) = 4(x — e1)(x — €2)(x — €3). 


But we know the roots of this polynomial from [AEC, VI.3.6], namely 


1 T T+1 
el =~ a7 ) e2=/ 97 p) €3 >= 2 9 TY: 
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Thus 
A(r) = 16(e; — e2)?(e, — €3)?(e2 — €3)?. 


The idea of our proof is to express A(r) in terms of special values of the 
Weierstrass o-function and then use the product expansion (6.4) for co. 
If we differentiate the equation 


p'” = 4(— — e1)(~ — €2)(@ — es) 
and divide by 2’, we find 


ut 


9 = 2(~9 — €1)(9 — €2) + 2(p — €1)(p — €3) + 2( — €2) (9 — €3). 


. ; iL T T+1 : 
Now if we evaluate successively at z= =, z= 3 and z= or we see in 


each case that only one of the three terms survives: 
” 1 =2 
ee ae (ea 63) (C13), 
nf 7 -9 
9 ar = 2(€2 — €1)(€2 — €3), 
T+1 
gy” ( 5 . = 2(e3 — €1)(e3 — €2). 


Comparing these formulas with the expression for A(r), we write A(r) in 
terms of values of 0”, 


1 Ft T+1 
A(T) = —29" (5-7) p" (5.7) ou ( 5 7) 


Recall (5.6b) that we have expressed ’ in terms of the Weierstrass o- 
function, 


; _ _a(2z,7) 
i ee a(z,T)4° 
Taking derivatives gives 
whe 97 (22,7) a(2z,T)o'(z,T) 
a(z,7)4 a(z,7)° 
Wy 17rtrt+il . 
If we evaluate go” successively at z = 3°95" , the second term will 


vanish, since o(z,7) has zeros at points in the lattice Zr + Z. We obtain 


go” ($7) = 97, forw =1,7,7+1. 
o (3,7) 
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Combining this with the above formula expressing A in terms of 9”, we 
find 
o'(1,T)o'(7,7)0'(7 + 1,7) 

1 41 4° 
[> (3.7) o (3,7) o (,7)] 

In order to compute the values of the derivatives in this expression, we 
take the transformation formula (5.4c) for o(z,7) and differentiate it with 
respect to z. This gives 


A(r) = 16 


a! (z +0,7) = P(w)n(w)e™ E+ 2) (2) + b(w)e™) 242) g'(z) 
for all w € Zr + Z. 
Now put z = O and use the fact that o(0) = 0 and o’(0) = 1 to get 


a'(w) = p(w)er™)/2, Taking w = 1, w = 7, and w = r +1 in succession 
yields 


o'(1) = —e(D/2, o'(r) = —e7™)/?, and of (7 $1) = —eM Dr +9)/2, 


Next we use Legendre’s relation (5.2d), which in our situation reads Tn(1) — 
n(T) = 2ni, to eliminate (7). After some algebra we obtain 


o/(1) =—e2",  o!(r) = —e?"""'q7?, and o(r +1) = —e2"7t)”, 
where to ease notation we write 7 = 7(1). 


The next step is to use the product expansion (6.4) for o to compute 
o at the half periods. Thus 


8 
4 
1 1 1 1+q” 
o(-,7T) = —e2". 94 ——}, 
€ ) (2Q7i)4 I 1— gq” 
4 1\4 
T 4 1 1,72 _ 1 (1 —g"t2) (1 ~q"~2) 
& = — e217 qg7!(1 — g2 Ao 
o (5:7) Qnty 4 (—4) TI (1 —q")8 


n>1 
8 


1 t3 1-—q"2 
= ——e2 q eons ; 
(27i)4 Ul 1—q" 


4 
T+1 1 a, De acs 1 
o (Spar) = geet at) 


occa Ca 


oor Gog)? 
8 
Jat ebn(rth)? gl peak’ Ne 
(27i)4 1—q" 
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_ qnry3 
= (27)! I = 4") 


— qen-1 
cope ee) 


2! Gag ae) 
= (2r)'%q Ul See 
since (TJ 1 — @?"~')({] 1-4") =]]1—4" 


= (21)}2q I[c —q”")** — since 1 — q?” = (1+ q”)(1—q”). 
n>1 
O 


In view of the exponent appearing in the product expansion for A(r), 
it is natural to study the function obtained by taking 24*-roots. 


Definition. The Dedekind n-function (rT) is defined by the product 


n(r) = e27*7/24 [[a —q") for r © H, q = e27"". 
n>1 


Warning. Do not confuse the Dedekind 7-function with the quasi-period 
map (5.2b) 7: A —> C. This may be especially confusing when A = A,, 
since then the symbol n(r) has two meanings, and it is quite possible for 
both to appear in a single formula. For example, 


(27)'* (Dedekind n(r))™* = A(r) = product of values of o(r), 


and using the product expansion (6.4) for o will give a formula involving the 
quasi-period 7(1). Why, you may ask, do we continue to use this confusing 
notation? Tradition! 


Proposition 8.3. (a) The Dedekind n-function satisfies the identities 
: 1 
n(r +1) = &27*/*4n(7), and (-2) = V-irn(r). 


Here we take the branch of ,/_ which is positive on the positive real axis. 


(b) 
A(r) = (20) !?9(7)*. 


ProorF. Note first that (b) is immediate from the definition of n(7) and Ja- 
cobi’s product formula (8.1) for A(r). Next, since the transformation T > 
7 + 1 does not change q, we see from the definition of (7) that 


n(r ae 1) = e27(7+1)/24 [Ja - q”) = 27/24 (7), 
n>1 
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Finally, we know that A(7) is a modular form of weight 12, so 
A ee = TA(r) 
: . 
Using (b) and taking 24**-roots shows that 
1 : 
ala} ey—er ar) 


for some 24*-root of unity «. Now evaluate at tT = 7. Since —1/i = i, we 
find that ¢ = 1. Oo 


Remark 8.4. More generally, let 


b : 
v= (2 4) €Sta@) with c > 0. 


Taking the 24*-root of (8.3b) and using the known transformation property 
of A(r) shows that 


nyt) = BRO ier + dn(r) 


for some integer ®() depending on y. For example, (8.3a) says that ®(S) = 
0 and ®(T) = 1. Note that although ®(7) is only defined modulo 24, we 
can pin down a particular value for ®(y) by fixing a branch of log n(r), 
setting 


oni 
8) = log n(yr) — log n(r) + § log{—i(er + d)} ifc >0 
and requiring ®(—) = ®(7) if c < 0. 

For many purposes it is important to know precisely how 7) transforms. 
The following theorem of Dedekind supplies the answer. First we need one 
definition. 


Definition. Let x and y be relatively prime integers with y > 0. The 
Dedekind sum s(x, y) is defined to be 


oon $4 (2-[5]-) 


j=l 


(The square brackets denote the greatest integer function.) 
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Theorem 8.5. (Dedekind) Let y = é Hy) € SL2(Z) with c > 0. The 
Dedekind 7-function satisfies the transformation formula 


m(y7) = PPO /—i(er + d)n(r), 


where \/_ is the branch of the square root which is positive on the positive 
real axis, ®(y) is given by the formula 


1 od 
O(y) =~ + = — Ls(d,c), 


and s(x,y) is the Dedekind sum defined above. 


PROOF. Since we will not need this result, we omit the lengthy proof. The 
interested reader might consult Apostol [1, Thm. 3.4] or Lang [2, Ch. IX]. 
O 


Remark 8.6. Dedekind sums s(z, y) satisfy many interesting relations. 
Of particular importance is Dedekind’s reciprocity law: Let x,y > 0 be 
integers with gced(x, y) = 1. Then 


zy 1 
BO) eA) ee ae 

See Apostol ({1, Thm. 3.7] or exercise 1.17. A good source for information 
about Dedekind sums is Grosswald-Rademacher [1]. 
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Let E'/C be an elliptic curve. We have seen amply demonstrated in [AEC] 
the importance of studying isogenies connecting our given elliptic curve E 
with other elliptic curves. If E(C) & C/A for some lattice A € £, then an 
isogeny E’ — E of degree n corresponds to a sublattice A’ C A of index n 
by the natural map 


C/A’ — C/A, Ze Zz. 


In keeping with our general philosophy in this chapter, rather than 
focusing on a single isogeny, we instead consider the set of all isogenies 
to E of degree n. Equivalently, we look at all sublattices of A of index n. 
This is the same as studying degree n maps from E to other elliptic curves, 
since we can always take the dual isogeny. In our situation, the dual isogeny 
C/A > C/A’ is induced by the map z+ nz. This leads to the notion of a 
Hecke operator. 
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Definition. For any set S, let Div(S'}) denote the divisor group of the set S, 
that is, the free abelian group generated by the elements of S, 


Div(S) = B Z-s. 
ses 


A homomorphism T : Div(S) — Div(S) is called a correspondence on S. 
Notice that a correspondence is determined by linearity once its values are 
known on the elements of S. 


Definition. Let n > 1 be an integer. The n‘** Hecke operator T(n) is the 
correspondence on the set of lattices £ whose value at a lattice A € & is 


Tin)A= S° (A). 
A'CA 
[A:AJ=n 
If two lattices are homothetic, then they give the same elliptic curve. 
This suggests that we should also look at the following homothety operator. 


Definition. Let A € C*. The homothety operator Ry is the correspondence 
on & whose value at a lattice A € L£ is 


RyA=dAA. 


Since the T(n)’s and the R)’s are homomorphisms which map the 
group Div(£) to itself, they can be composed with one another. The fol- 
lowing fundamental calculation describes the algebra that they generate. 


Theorem 9.1. 

(a) RyRy = Ry for all A, uw € C*. 

(b) RyT(n) = T(n)Ry for allX € C*,n > 1. 

(c) T(mn) = T(m)T(n) for allm,n > 1 with gcd(m,n) = 1. 
(d) T(p*)T(p) = T(p**") + pT (p"")Rp _ for p prime, e > 1. 


PROOF. (a) 
RR, (A) = Ra(uA) = AwA = Ry, (A). 


(b) This follows immediately from the definitions and from the fact that A’ 
is a sublattice of A of index n if and only if AA’ is a sublattice of AA of 
index n. 

(c) Let A” ie A, where the superscript mn denotes the index. Since m 
and n are relatively prime, the quotient A/A” has a unique decomposition 


A/A”" = ®y x ®y, with |®,,| = m and |®,| =n. 
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It follows that there is a unique intermediate lattice A’ satisfying 
AvENEA 
namely 
A’ ={xeEA:mae A"}. 
Using this fact, it is now easy to verify (c). 


T(mn)A = 2 (A") = S> Ss (A") 


NITOA NCA ANCA 
= S> T(m)(A/) = T(m)( - (A’)) = T(m)T(n)A. 


NCA NCA 
e+1 
(d) Let A € £. For a given sublattice A’ é A, let a(A’‘) and b(A’) be the 


integers defined by 


— aes oar nfl if A’ Cpa, 
a(A')=4#{T: A’ CT CA} and wa) = {4 if A’ ¢ DA. 


Then 
TP)TRA=S> So (A)Y= SO aA), 
rea wer Meek 
Tip )A= D7 (A’), 
Wea 
Te )RA= S> (AY) = SO d(A)(A). 
ie pi we OR 


(Note that pA has index p? in A.) The identity (d) we are trying to prove 
is thus reduced to verifying 


+ 


e+1 
a(A’)=1+pd(A’) forall ATC A. 


We consider two cases. 
Case 1. A’CpA, 0(A’) = 1] 


Let © C A. Then T > pA > A’, so a(A’) is increased by one for each 
such I. Hence 


a(A’) = #{P:P CA} =p+1=1+4 pd(A). 
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[Quick proof of the middle equality: The set of T c A corresponds to 
subgroups of A/pA & (Z/pZ)? of index p (or equivalently of order p). 
These are the lines in A?(Z/pZ), so there are #P!(Z/pZ) = p+1 of them. 
For a proof of a more general result, see (9.3) below.| 


Case 2. A’ ¢ pA, 0d(A’) =0 


Let I satisfy A’ CP CA. Note that pA CT. We have inclusions 


A’ T p A 
0 Cc : 
= ag ope ok 
T 
Not equal, Index p, 
since A’ ¢ pA. since PC A. 


But A/pA has order p?, so we conclude that the middle inclusion must be 
an equality. Therefore 
T= A'+paA. 


Thus for a given A’ ¢ pA there is exactly one I satisfying A’ c T CA. 
Hence 
a(A’) = 1=1+4 pdb(A’). 
O 


Corollary 9.1.1. Every T(n) is a polynomial in the T(p)’s and R,’s for 
primes p. More precisely, the rings 


Z[T(n), Rn: n€Zn>I1] and = Z|T(p), Rp : p prime] 


are the same. This ring is called the Hecke algebra (of T(1)). (Notice that 
the Hecke algebra is a subring of the ring of correspondences 


End(Div(£)) = {homomorphisms Div(£) > Div(L)}. ) 


PRoor. Factor n = pj! ---p&". From (9.1a) and (9.1c) we find 
Rrn=|[ RR and = T(n) = [] T(v%'). 
i=l i=1 
Finally, (9.1d) and an easy induction on e shows that T(p*) is a polynomial 
in T(p) and Rp. Oo 


Corollary 9.1.2. The Hecke algebra Z[T(n), Rn : n € Z,n > 1] is com- 
mutative. In particular, 


T(m)T(n) = T(n)T(m) for all m,n > 1. 
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(Note that End(Div(£)) is definitely not commutative.) 


PRoorF. From (9.1a,b,c), we are reduced to showing that T(p°) commutes 
with T(pf). This follows from (9.1.1), since both T(p*) and T(p/) are 
polynomials in T(p) and R,, which commute from (9.1b). Oo 


Example 9.2. Using (9.1d), it is easy to illustrate (9.1.1) for small pow- 
ers T(p*). For example, 


T(p’) = T(p)? _ PRp, 

T(p*) = T(p)’ — 2pRpT(p), 

T(p*) = T(p)* — 3pRpT (p)? + p? Re. 
For a general recursion, see exercise 1.19. 


The Hecke operator T(m) sends a lattice A to the sum of its sublattices 
of index n. We now describe these sublattices more precisely. Let A € £, 


and fix an oriented basis Zw, + Zw for A. For any A’ Cc A, we choose an 
oriented basis w},w, for A’ and write 


wy = aw, + bwo, w = cw, + dwa, 


with integers a,b,c,d. Then one easily checks that 


n=[Asa}=aet (2 4 = ad — be. 


Here’s a quick geometric proof of this fact. The linear transformation a = 


i 7 acting on the vector space R? © Rw, +Rw, = C sends a fundamental 


parallelogram D for C/A to a fundamental parallelogram for C/A’. Hence 


Area of aD 
: i _ 
[A: A’] = ia ae det (a). 


Conversely, if ad — bc = n, then 
A’ = Z( aw + bw) + Z(cwy + dw) 


is a sublattice of A of index n. We thus obtain a map 


{a € M2(Z) : det(a) =n} —> {A’: A’ e A} 
a= e i) r> a(A) = Z(aw, + bw2) + Z(cw; + dw). 


(Here M2(Z) is the ring of 2 x 2 matrices with integral coefficients.) Note 
that a(A) depends on the choice of basis for A, although our notation does 
not reflect this dependence. 
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It is possible, of course, for different a’s to give the same sublattice. 
According to (1.2), we have a(A) = a’(A) if and only if a = ya’ for 
some y € SL2(Z). Note that the basis for a(A) will be oriented if the basis 
for A is, since (1.1) gives 


(= + =) _ (det a) Im(w1/w2) 
cw, + dw2 |c(w /w) + d\” , 


and det(a@) = n > 1. This proves the first half of the next lemma, which 
we state after setting some notation. 


Notation. Let n > 1 be an integer. We define 


D,={(° 3 € M,(Z) : ad~ be =n}, 


Note that 8, is a finite subset of D,, having order 


d-1 
#8n = > S21 =0i(n). 


d|n b=0 
Note also that SL2(Z) acts on D,, via multiplication: if y € SL2(Z) and a € 
Dy, then det(ya) =n, so ya € Dy. 


Lemma 9.3. Let A € £ bea lattice given with a fixed oriented basis A = 
Zw + Zw. 
(a) There is a one-to-one correspondence 


SLo(Z)\Dn 24 {a’ DATE a} 


ns, (2 4) —? a(A) > Z(aw, + bw) + Z( cw + dw). 
(b) The natural inclusion 8,, C D, induces a one-to-one correspondence 


Sp <> SLo(Z)\Dn- 


PRooF. (a) This was proven during the discussion above. 
(b) Let a = ( i) € Dy. We construct a y € SL2(Z) such that ya € 


8,. Suppose first that c # 0. Write the fraction —a/c in lowest terms, 
say —a/c =s/r. Since r and s are relatively prime, we can find integers p 
and qg so that ps — gr = 1. Then 


(FS) a) Gs) mm (Faas 
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so we are reduced to the case that c = 0. Replacing a by —a if necessary, we 
may also assume that a,d > 0. Finally, for an appropriate choice of t € Z, 


the matrix 
1 ¢ a b\_ fa b+td 
0 1 0d) \O d 


satisfies 0 < b+td < d, so it is in 8,. This proves that 8, surjects 
onto SL2(Z)\Dn. 

Suppose now that a,a’ € 8, have the same image. Thus there is a 
7 = (2 9) in SL2(Z) such that 


a b\ (pq a’ W\ fap b'p+d'q 

0d} \r s 0d) \adr Ur+d's)- 
Since a’ # 0, the lower left-hand entry gives r = 0. Next, comparing 
diagonal entries, we find 


=1, a,d,a’,d’ > 0. 


a=a'p, d=d's, ps 


It follows that p = s = 1, and soa =a’ andd=d’. Finally, we have 
b=b'+d’q and (by assumption) 0<b,b' <d =d. 


Hence |d’g| = |b — b’| < d’, from which we conclude that g = 0 and b = b’. 
Therefore a = a’. oO 


Proposition 9.4. Let A € & be a lattice, and let A = Zw; + Zwo be an 
oriented basis for A. Then the Hecke operator T(n) is given explicitly by 
the formulas 


T(n)A= > (Z(aw; + bwe) + Zdw2) = S> (a(A)). 
ad=n,a>1 acsn 
0<b<d 


(The notation a(A) is as in (9.3a).) 


ProoFr. Immediate from (9.3), which says that the sublattices of A of 
index n are precisely the lattices a(A) with a € 8. oO 


Example 9.4.1. For primes p, (9.4) gives the formula 


p-l 
T(p)A = (Zpun + Zw) + $>(Z(wy + bw) + Zpwe). 
b=0 
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§10. Hecke Operators Acting on Modular Forms 


In the last section we described Hecke operators T(n) which assign to a 
lattice A € £ a formal sum of lattices 


Tin)A= >> (A), 
NCA 
and we also gave homothety operators R) defined by Ry(A) = AA. Letting 


F : £ — C be any function on the space of lattices, we define new 
functions T(n)F and R)F on £ in the natural way, 


(T(n)F)(A)= S$ F(A’) and (Ry F)(A) = F(AA). 
ACA 
We would like to define an action of T(n) on the space of modular 
functions f of weight 2k. Unfortunately, a modular function f is not a 


well-defined function on the space of lattices £; it is only a function on the 
space of lattices with given bases: 


A= Zan +Zun— 7 (2). 
We 
However, we can use the fact that f is modular to construct a function 
on £ having a certain homogeneity property, as described in the following 
proposition. 
Proposition 10.1. There is a one-to-one correspondence 
lattice functions F : £4 C 
satisfying F(AA) = \~2* F(A) 
for all \ € C* 
f — ~— F (Zw; + Zw) = wy >* f (w/w), 
fr(7) = F(A) — F. 


weakly modular functions aah, 
f:H—-C of weight 2k 


PROOF. First we check that Fy(A) depends only on A, and not on the 
choice of an (oriented) basis for A. From (1.2a), any other oriented basis 
has the form 


(aw + bwe, cw, + dw2) for some (: D € SLa(Z); 
so 
= aw, + bw 
F 5 ((awy + bw2)Z + (cw + dus2)Z) — (cw + dz) aa (S22) 


2k 
= (cw, + dw)~?* (2 + a) (2) 
we 


= wf (=) _ Fy (Zw + Zu2). 
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Next, it is clear that 
Fy(AA) = F;(ZAw1 + ZAw2) = A7?* Fy (A). 


Similarly, if 7 = (2 i) € SLo(Z), then 


Ay, = (er + d)7!(Z(ar + 6) + Z(er +. d)) = (cr +d)" A, 
which implies that 
fr(qt) = F(A,7r) = F((er+d)71A,) = (ert+d)?*F(A7) = (er +d)** f(r). 


This shows that the indicated maps are well defined. 
Finally, we check that they are inverse to one another, which will prove 
that they give one-to-one correspondences between the indicated sets. 


Ee Ww —2b7 
Fy, (A) = Wo * fr (2) = ‘ Pi Nie) 


ee (2 + z) = F(Zw, + Zw.) = F(A). 
2 


fr, (7) = Fy(A,) = Fy (Zr + Z) = f(r). q 
Using (10.1), we can define Hecke operators on the space of modular 
functions of weight 2k. It turns out to be convenient to multiply by the 
scalar factor n2*-!, which will prevent the appearance of denominators 
in (10.3) below. 
Definition. The n‘" Hecke operator T2,(n) on the space of (weakly) mod- 
ular functions of weight 2k is defined by the formula 


(Tx,(n)f) (r) as n2k-1 » F;(A’) = n2k-} y, Ye (F *) 


n ad=n,a>1 
A’CA, 0<b<d 


Here F;(Zw, + Zw) = wz 7* f(wi/we) is as in (10.1). The equality of the 
last two expressions is immediate from (9.3), which says that 


n 


{A’: A’ CA} = {Z(ar +b) + Zd:ad=n,a>1,0<b< d}. 
Theorem 10.2. Let f be a modular function (respectively modular form, 


respectively cusp form) of weight 2k. Then so is T2,(n)f. 


ProoF. First, we verify that To,(n)f has weight 2k. By definition, T2;,(n)f 
is associated as in (10.1) to the lattice function n?*~!T(n)Fy. The scalar 
factor n?*—! is immaterial, and we have 


(T(n)F)(AA) = ee F(A') = S> F(A’) 
NOMA ACA 
= 798 S* F(A’) = A? (T(n)F)(A). 


MCA 
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Again invoking (10.1), it follows that T(n)F} corresponds to a weakly mod- 
ular function of weight 2k, and so T2;(n)f is weakly modular of weight 2k. 


Next we observe that if f is meromorphic (respectively holomorphic) 
on H, then the formula 


(Tox(n)f)(r) = n?*-? aang (7 4") 


shows that the same is true of T(n)f. 


It remains to check the behavior of Ta;(n) f at co. The next proposition 
gives an explicit formula for the Fourier coefficients of Ta,(n) f, from which 
it follows by inspection (10.3.2) that T2,(n)f is meromorphic (respectively 
holomorphic, respectively zero) at oo if f is. This completes the proof 
of (10.2). oO 


Proposition 10.3. Let f(r) = S>c(m)q™ be a modular function of 
weight 2k. Then the Fourier series for T2,(n)f is 


(Tox (n)f) (T) = ss y(m)q™, where y(m)= > azk—-le (>) ; 


meZ al gcd(m,n) 


As a special case of (10.3), we list the values of y(0), y(1), and y(p) for 
primes p. Notice in particular that y(1) = c(n). Thus in some sense T;(n) 
acts as a shifting operator on the Fourier coefficients of f. 


Corollary 10.3.1. With notation as in (10.3), 
(a) 
(0) = c(0)o24~-1(n) and ¥(1) = e(n). 


(b) For primes p, 


_ J c(pn) +p**"'c(n/p) if pin, 
via) = { er) ao eee epite 


Remark 10.3.2. Notice that if c(m) = 0 form < —mo < 0, then y(m) = 0 
form < —mon. This is clear because y(m) is a sum of terms of the 
form c(mn/a?) with a| ged(m,n), so mn/a? < —mo. Thus T2x(n)f will be 
meromorphic (respectively holomorphic, respectively zero) at oo if f is. 


§10. Hecke Operators Acting on Modular Forms 77 


PROOF (of Proposition 10.3). We use the formula defining To;,(n)f and 
compute 


(Tox (n)f)(7) = n2k-1 > dF (= vt) 


ad=n,a>1 
O0<b<d 
= n2k-l >: q-2k »: eae ernie 
ad=n,a>1 meZ 
0<b<d 
= n2k-1 5 >: e(mi)d- Fettimer/4 5 e2timb/d_ 
mé€Z ad=n,a>1 O0<b<d 


The innermost sum is 


S e2timb/d = i if d|m, 
exe 0 ifdtm. 


Replacing m by md = mn/a and using n/d = a, we find 


(Tox (n)f) (T) = > oD a2k-} erimar, (TM) 


mé€Z ad=n,a>1 
and collecting equal powers of g = e277 (let M = ma) yields 


Mn . 
aS > aa : aoe 
M€EZ al gcd(M,n) @ g 


Suppose that a modular function f is an eigenfunction for the Hecke 
operator T>;,(n). This means that there is a constant A(n) € C so that 


(Ton (n)f) (7) = A(n)f (7) for all 7 EH. 


Using (10.3) to compare the Fourier coefficients of A(n)f and To,(n)f, it 
is clear that the eigenvalue A(n) is related in some way to the Fourier 
coefficients of f. 

Of particular importance are those modular forms which are simul- 
taneous eigenfunctions for every T2,(n). Although it may seem unlikely, 
a priori, that there are any such functions, we will later observe (10.9) that 
in fact MQ, has a basis of such functions. In any case, we can already 
construct the following examples. 


Example 10.4. The modular discriminant A(r) is an eigenfunction for 
every Hecke operator Ti2(n), n > 1. To see this, note that (10.2) says 
that T)2(n)A is also a cusp form of weight 12. But from (3.10.2) the space 
of weight 12 cusp forms M! a has dimension 1. It follows that Ty2(n)A is a 
constant multiple of A. 

Similarly, G4(7) and Ge(7) are eigenfunctions for T4(n) and Te(n) 
respectively, since the spaces M4, and Mg have dimension 1 (3.10.1). In 
fact, it is not hard to show that G,(7T) is an eigenfunction for T,(n) for 
all k >2 and n> 1. See exercise 1.25. 
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We now describe the relationship between the eigenvalues and the 
Fourier coefficients of simultaneous eigenfunctions. 


Theorem 10.5. Let f(r) = S>c(m)q™ 40 be a cusp form of weight 2k, 
and suppose that f is an eigenfunction for all Hecke operators To,(n), say 


Tax (n)f = A(n)f. 
Then 
c(1) £0, and c(n) = XA(n)c(1) for alln > 1. 


PROOF. Comparing the leading coefficient in 


A(n)f =A(n)eA)qt+--+ and Toz(n)f =c(n)qt+---, 


(see (10.3.1b)), we find that c(n) = A(n)c(1). This proves the second part 
of the theorem. 


Suppose now that c(1) = 0. Then what we have proven implies 
that c(n) = A(n)c(1) = 0 for all n > 1, so f = 0. This contradicts 
our original assumption that f #0. Hence c(1) 4 0. Oo 


Definition. A simultaneous eigenfunction as in (10.5) is called normalized 
if c(1) = 1. In view of (10.5), every simultaneous eigenfunction is a constant 
multiple of a normalized eigenfunction. 


In the last section we proved several identities (9.1) for Hecke oper- 
ators T(n) acting on the space of lattices £. These give us the following 
identities for the action of Hecke operators on modular functions, which in 
turn give us relations on the Fourier coefficients of simultaneous eigenfunc- 
tions. 


Proposition 10.6. Let f be a (weakly) modular function of weight 2k. 
(a) Top(mn) f = Ton (m)To4,(n) f for all m,n € Z with ged(m,n) = 1. 


(b)  Tox(p®)Tan(p) f = Tar (p°*') f + p** Tox (p°") f 
for all primes p and alle > 1. 


PROOF. (a) This is immediate from (9.1c). 
(b) We apply the identity (9.1d) to the lattice function F’ described 
in (10.1). Since 

RyFy(A) = Fy(AA) = AP*Fj(A), 


we find 
T(p°)T(p) Fy = T(p°t!) Fy + p77 (po!) F 5. 
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By definition, T2,(n) f = n?*-!T(n) Fy, so multiplying by p&+)(*—) gives 
the desired result. Oo 


Corollary 10.6.1. Let f(7) = So c(n)q" 4 0 be a cusp form of weight 2k 
that is a normalized eigenfunction for every Hecke operator T2,(n). 


(a) e(mn) = c(m)c(n) for all m,n € Z with ged(m,n) = 1. 


(b) c(p*)c(p) = c(p*t!) + p**-*e(p°-!) for all primes p and e > 1. 


PROOF. (a) We combine (10.5) and (10.6a) to find (note that c(1) = 1) 


c(mn) f = A(mn)f = Toe(mn) f 
= Tzi,(m)T2(n) f = A(m)A(n) f = e(m)e(n)f. 


(b) This follows similarly from (10.5) and (10.6b). Oo 
Example 10.7. Let 7(n) be Ramanujan’s r-function defined by 
(20) PA(r) = So r(n)a” =a TT - 9"). 
n>1 n>1 

(See (7.4a) and (8.1).) Then (10.4) says that > r(n)q” is a normalized 
simultaneous eigenfunction, so (10.6.1) gives the relations 

T(mn) = T(m)r(n) for all m,n € Z with gcd(m,n) = 1. 

T(p°)r(p) = t(p*t!) + pr (p*!) for all primes p and all e > 1. 

These identities, conjectured by Ramanujan, were first proven by Mordell. 


Remark 10.8. It is clear from (10.6.1) that modular forms that are si- 
multaneous eigenfunctions have many interesting arithmetical properties. 
(We will see some additional ones in the next section.) We have given 
examples of such functions (e.g., A(7)), but so far we only know finitely 
many such examples. The following theorem of Petersson shows that there 
are many functions to which (10.6.1) applies. We will not give the proof, 
which requires additional machinery involving subgroups of SL2(Z). 


Theorem 10.9. (Petersson [2]) The set 
{f € M$, : f is a normalized eigenfunction for all Tox(n), n > 1} 
is a basis for the space M3, of cusp forms of weight 2k. 


Proor. See Lang [2, Ch. III §4], Ogg [1], Shimura [1, Ch. 3 §§4,5], or 
exercise 1.22. oO 
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§11. L-Series Attached to Modular Forms 


Let f(r) = So e(n)q” 4 0 be a cusp form of weight 2k which is a normalized 
eigenfunction for all Hecke operators T2;,(n). Then the Fourier coefficients 
of f satisfy the identities given in (10.6.1). We now show that these identi- 
ties are equivalent to an Euler product decomposition for a certain Dirichlet 
series attached to f. 


Definition. For any power series 
f => e(n)q” € Cla], 
n>1 
the L-series attached to f is the (formal) Dirichlet series 
L(f,s) = S> e(n)n~*. 
n>1 


Proposition 11.1. Let f = 5° c(n)g” be a power series with c(1) = 1. 
n>1 


Then the coefficients of f satisfy the identities 

(i) c(mn) = e(m)c(n) for all m,n with gcd(m,n) = 1, 

(ii) c(p®)e(p) = c(p*t) + p?*-*e(p*-!) for primes p and e > 1, 

if and only if the associated L-series L( f,s) has the Euler product expansion 


ie 1 
(iii) Lif, s) = II 1—c(p)p=* + pee i328 


(Note that this is an equality of formal Dirichlet series. We have said 


nothing yet about convergence properties. ) 


PROOF. Suppose first that f satisfies (i) and (ii). The multiplicativity 
relation (i) implies that we can decompose L(f,s) into a product over 


primes, 
L(f.s) = So enn = T] See’. 


n>1 p e>0 


If we multiply the inner sum by 1 — c(p)p~* + p?*-1~5, we find 
(1 = e(p)p~* + p™*1-**)(S* e(p")p~™) 


e>0 
= om c(p®)p~ aS S > e(p)e(p*)p™ e+) ft Joep ee es 
e>0 e>0 D0 


= {c(1) + c(p)p~*} — {e(p)c(1)p-*} 
+ 5° (c(p®) — e(p)e(p*~!) + c(p*?)p?***) p-** 


e>2 


=1 using (ii) and c(1) = 1. 
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Hence 


S_c(p®)p-** = : 


~ 7 - 2k~1—-2s? 
0 1 — c(p)p~* + p : 


which proves that L(f,s) has the Euler product expansion (iii). 
We leave the converse as an exercise, since it will not be needed in the 
sequel. See exercise 1.23. oO 


In order to prove that the formal Dirichlet series L(f,s) converges in 
some half-plane Re(s) > so, we need an estimate for the size of the Fourier 
coefficients of f. 


Theorem 11.2. (Hecke) Let f(r) be a cusp form of weight 2k with 
Fourier expansion )> c(n)g". There is a constant %, depending only on f, 
such that 

|e(n)| < «nk for alln > 1. 


Remark 11.2.1. Let f(z) = )¢e(n)g" be a normalized cusp form of 
weight 2k which is a simultaneous eigenfunction for all Hecke operators 
T2,(n). Then the Fourier coefficients of f actually satisfy the stronger 
estimate 

|e(n)| < ao(n)n*-2, 


where oo(n) is the number of positive divisors of n. This is the generalized 
Ramanujan conjecture (for ['(1)), which was proven by Deligne [1,2] as a 
consequence of his proof of the Riemann hypothesis for varieties over finite 
fields. 


Remark 11.2.2. If f(7) is a modular form of weight 2k which is not a 
cusp form, then the Fourier coefficients of f grow at the faster rate 


Kynrk-l < |e(n)| < Kan?*-?. 
See exercise 1.24. 


ProoF (of Theorem 11.2). For any y > 0, we can extract the n** Fourier 
coefficient of f by integrating 


1 
c(n) = ih e 2rin(z+ey) F(a + iy) dex. 
0 


Hence 
|e(n)| < e?7™ sup |f(x + iy)|. 
0<z<1 


Next consider the (non-negative) real-valued function 


(7) = [F(r)|(Im7)*. 
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Using (1.1) and the fact that f is modular of weight 2k, we see that 
o(yT) = (7) for all y € T'(1). 


Hence 
sup $(T) = sup ¢(T), 
TEH TES 


where ¥ is the usual fundamental domain (1.5) for '(1)\H. Further, ¢ is 
continuous on ¥, and 


(Im r)* = 0. 


¢ _ tk 2rint 
Basel) ans) ae 


Note the importance of knowing that f is a cusp form, since if c(0) 4 0, 
the limit would not exist. It follows that ¢ is bounded on F, and so it is 
bounded on all of H. 
Let 
C =sup¢(r). 


TEH 
Then 


|f(a + ty)| = d(a+iy)y* <Cy* — for alla +iy EH. 
Substituting this estimate for f into the above inequality for |\e(n)| yields 
|e(n)| < Cy el, 


This inequality is valid for all y > 0. In particular, putting y = 1/n gives 
the desired result. oO 


Corollary 11.3. Let f be a cusp form of weight 2k. Then the as- 
sociated L-series L(f,s) converges to give a holomorphic function in the 
half-plane 

Re(s) >k+1. 


PROOF. From (11.2) we have 
\e(n)n~*| = |e(n)|n— Re(s) < ggnk—Re(s) | 
Hence >> c(n)n~§ is absolutely convergent provided Re(s) > k +1. oO 


Our next goal is to show that the L-series L(f,s) attached to a cusp 
form f has an analytic continuation to all of C and that it satisfies a 
functional equation similar to the functional equation satisfied by the Rie- 
mann ¢-function. 
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Theorem 11.4. (Hecke) Let f(7) be a cusp form of weight 2k. 
(a) L(f,s) has an analytic continuation to all of C. 
(b) Let 

R(f, 8) = (20) °P(s) LF, 8), 


where I'(s) is the usual T-function. Then 


R(f,2k—s) =(-1)*R(f,s) for alls EC. 


PRooF. The I-function is given by the integral 
I(s) = i. tole dt for Re(s) > 0. 
0 


(For basic facts about [, see Ahlfors [1].) Replacing t by 2ant in the 
integral, we obtain the useful formula 


ne= nr (sy | poet ae 
0 


Write f(7) = )oe(n)q". Multiplying our formula for n~* by c(n) and 
summing over all n > 1 gives 


L(f,s) = > c(n)n~F = S- {o(n)2n)°T(s)™ I pte eam ar} 


n>1 n>1 


= (On) L(sy* ie po Ss} e(nje~?"™ dt 


n>1 


= (2n)°T(s)7} 7 tf (it) dt. 


Note that since |c(n)| < «n* from (11.2), the quantity 


Len f tote 27nt de 
0 


n>1 


is absolutely convergent for Re(s) > k +1, so it is permissible for us to 
reverse the order of the sum and the integral. 

We split the above integral for L(f,s) into two parts. For large t the 
integral will converge for all s € C. For small t we replace t by 1/t and use 
the fact that f satisfies 


F(z) = s(8t9) = a0. 
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Thus 


(27) ~*T'(s)L(f, 8) = [ t*-1f(it)dt from above, 


1 [oe] 
zs | #1 f(it) dé + / #1 f(it) dt 
1 


0 
1 s—l . [oe] 
is 1 v7 1 s—l py: 
“LG) Gi) aG)+ [Cesena 
fo.) OO 
=| (—1)*t?4-$—! fit) dt +f t®—1 f (it) dt. 
1 1 
This gives us the integral representation 
LC.) = (Bmy*T(e)t fe + (-1yhPho1} pt), 
1 
valid a priori for Re(s) > k +1. 
But ['(s)~! is holomorphic on C, and by inspection the integral is 
absolutely and uniformly convergent for s in any compact subset of C. 
(Note that since f is a cusp form, |f(it)| goes to 0 like a multiple of e~?7* 


as t + oo.) Hence this integral gives the analytic continuation of L(f, s) 
to C. Finally, we observe that the expression 


e(s,t) = t9-1+(—1)*t?*-8-! satisfies (2k — s,t) = (—1)*e(s, t). 
It follows immediately that 
R(fys) = (2m) "T(s)L(f,3) = f e(sst) fit) dt 
1 
has the same functional equation, R(f,2k — s) = (—1)*R(f,s). oO 


We record as a corollary the useful integral expression for L(f,s) de- 
rived during the course of proving (11.4). 


Corollary 11.4.1. Let f(r) be a cusp form of weight 2k. Then 


L(f, s) = (27)*T(s)7! [ee + (~1)*t?*-*-1} $ (it) dt. 


EXERCISES 


1.1. Prove that the modular group ['(1) is the free product of its subgroups (S) 
and (ST) of orders 2 and 3. 
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1.2. 


1.3. 


1.4. 


1.5. 


Let 7o € Q, and let y € I'(1) satisfy yoo = 7). Prove that 7 sends the set 
{7 €H: Im(r) > &} 


to the interior of a circle in H which is tangent to the real axis at 79. Prove 
that the radius of the circle goes to 0 as k — ov. 


Give an example of a Hausdorff space X and a topological group I’ acting 
continuously on X such that the quotient space [\X, taken with the quo- 
tient topology, is not Hausdorff. (By definition, the action of [ on X is 
continuous if the map T x X > X, (7,2) +> ya, is continuous.) 

For any a € C, let ga(t) = (7 — a) /(7 — @). 

(a) Prove directly that 


gi(St)=—gi(t) and — gp(STT) = p*go(r). 


(As usual, i = e™’/? and p = e?7*/3,) 


(b) Find the largest disk U C H centered at 7 such that the map 
{1,S}\U SC, re > gilt)? 


is injective. Compute its image and its inverse. 
(c) Same as (b) for U centered at p with 


{1, ST,(ST)?}\U —C, TR> g(t)’. 


Let 7 € H be a point satisfying a quadratic equation 


r?—ar+b, a,beZ, a’ — 4b <0. 

Suppose further that Z[r] is the ring of integers of the quadratic imaginary 
field Q(r). 

(a) Prove that the fractional ideals of Q(r) are in one-to-one correspon- 
dence with the lattices L contained in Q(r) which satisfy 7L C L. (In this 
context, a lattice is a free Z-module of rank 2.) 

(b) Prove that every ideal class is represented by a fractional ideal of the 
form 


L+T 


Zt+ Z 


satisfying the following conditions: 

(i) z,y€Z with y > 0, 

(ii) 4y? — (4b — a?) < (Q2 +a)? < y’, 

(iii) yl? + ax +b. 
Conclude that the class number of Q(r) is finite. 
(c) Prove that the ideal classes in (b) are distinct provided that we discard 
all pairs (x, y) satisfying either of the following conditions: 

(iv) 2x +a=-y, 

(v) a +ar+b=y? with 2r+a<0. 
(d) Use the above algorithm to compute the class number of the following 
quadratic fields: 


Q(V—-3), Q(V—3), Q(V=23), O(Vv—29), Q(V—47). 
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1.6. 


ae 
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(a) Prove that the natural reduction map 
SL2(Z) —> SL2 (Z/NZ) 
is surjective. 


(b) Define [(V) to be the subgroup of I'(1) consisting of matrices congru- 
ent to 1 (mod N), that is, 


_ a b _a=d=1(mod N) 
ra {(t i) pea ao 


Prove that 


6 if N = 2, 
[PQ) : TWN)] = IN* T[(1-p-?) if N>3. 
p|N 


(c) Prove that P(N) is a normal subgroup of ['(1) and that 
T(1)/T(N) & Sle (Z/NZ) /{+1}. 


Define subgroups [o(.NV) and Pi(N) of P(1) by 


To(N) = {(¢ .) €T(1):¢=0(mod wh, 


rom={(¢ ) €PC) sa d= 1 (mod N), = 0(aod Ny, 


(a) Prove that [y(N) is a normal subgroup of I'o(V), and show that 


To(N)/Ti(N) © (Z/NZ)* /{+1}. 


(b) Prove that 


Ti(N)/C(N) = Z/NZ_ (taken additively). 
(c) Prove the following two formulas: 


[F(a) : To(N)] = N [Ja +p"). 
pIN 
3 if N = 2, 
re riad= | ga a if N > 3. 


p\N 
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1.8. 


1.9. 


1.10. 


Let X/C be a smooth projective curve of genus g. For any divisor D = 
ys nz(x) with real coefficients nz € R, let 


[D] = $—[nz](x) € Div(X) 


rex 


be the integer part of D, where [nz] denotes the greatest integer in nz. 
Also let 


£(D) = {f € C(X)" : div(f) = —D}U {0}. 


(a) Prove that £(D) & £([D]). 
(b) Let k > 1 be an integer, and let Kx be a canonical divisor on X. 
Prove that 


{w € OK : div(w) > —D} © L(kKx + [D]). 
Let ¢: H* — X(1) be the usual projection, and let 


Do = 5(9(2)) + 3((p)) + (#(00)) € Div(X(1)) @Q. 


(a) Prove that the map 
Moar — {we (1) : div(w) > —kDo}, frours 


is an isomorphism. Here wy is the k-form described in (3.7a) having the 
property ¢*wy = f(r) (dr)*. 
(b) Conclude that Max =~ £(kKx(1) + [kDo]). Use the Riemann-Roch 
theorem [AEC II.5.4] to calculate the dimension of M2,, thereby giving an 
alternative proof of (3.10c). 


(a) Prove that the set 
{GG :a,b€ Z, a,b > 0, 2a+ 3b = k} 
is a basis for Max. 
(b) Conclude that the map 
CIX,Y] + CM, P(X, Y) > P(Gs, Ge), 
k=0 
is an isomorphism of graded C-algebras, where we grade C[X, Y] by assign- 


ing weights wt(X) = 2 and wt(Y) = 3. In particular, the functions Ga(r) 
and G(r) are algebraically independent over C. 
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1.11. This exercise outlines an elementary proof that the modular j-invariant 
defines a bijectivemap 


j:TQ)\\H— C. 


Fix some jo € C, let H be a large real number, and let F(H) C H be the 
region bounded by the curves 


|r] =1, Re(r)=4, Re(r)=—4, Im(7r) =H. 
Let 0F(H) be the boundary of ¥(H), which we take with a counter- 
clockwise orientation. Assume for now that j(7) # jo for all 7 € OF(H). 


(a) Prove that 


EBA) Aah ‘| FAG) fe 


Qni a5(H) j(T) — jo 
(b) Prove that 


fF 
Hoo Qmt Jagr) j(T) — jo 

(Hint. Use (7) = j(r +1) = 9(—1/7) to cancel out most of the line integral, 
and use j(T) = q~!+(power series in q) to evaluate the remaining piece.) 
(c) Conclude that 7(F) = C and that 7 is injective on the interior of F. 
(d) If 7(70) = jo for some 7 € OF, use a slightly modified region to show 
that 7 is still injective. Conclude that j7 maps the quotient ['(1)\H bijec- 
tively to C. 

(e) Use the bijectivity from (d) to prove the Uniformization Theorem (4.3). 


1.12. Let A, A’ C C be lattices satisfying 
Ga(A) = Ga(A’) and Ge(A) = Ge(A’). 
Prove that A = A’. 


1.13. *Let A = Zw, + Zw be a lattice given with an oriented basis, and let 77 : 
A — C be the associated quasi-period map. Prove that 


(w1) 1 (wo) _ 1 
oie 2 DD (mw + nw)?’ Ta ~ ye Ss (mw + nw)?” 


neE€Z mEeZ méZ neZ 
nZ#0 m0 m#0 n40 


N.B. These double series are not absolutely convergent; the order of sum- 


mation really does matter. 
1.14. (a) Provethat 


[[ (+A) - (2 +4;A)) 


uve pA/A 2 2 
ie (2N?~3)(N?—1) 
% =+N"'*p'(Nz;A)N A(A) 12 


(b) Prove that 
(N?—1)(N?-3) 
I] (A) - ea) =n" May 


U,vEePA/A 
uZXtv (mod A) 
u,v#0 (mod A) 
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1.15. Let E/C be the elliptic curve associated to the oriented lattice A = Zw. + 
Zw. Recall the Weil pairing 


em : Elm] x E[m] — pm 
defined in [AEC III 88]. Prove that on 

E{m] =m7'A/A C C/A, 
the Weil pairing is given by the formula 


‘ (= + bwe cwi + “) — 2m i(ad—be)/m 
7 m ? m ‘ 
(Hint. Use (5.5) to write the elliptic functions appearing in the definition 
of €m as products of o functions.) 
1.16. Let s(x, y) be the Dedekind sum defined in §8. 
(a) Prove that 
(y= Dy ~2) 


s(1,y) = Dy 


(b) Derive a similar formula for s(2,y). (The answer will depend on the 
parity of y.) 
(c) Prove that s(y? + 1,y) = 0 for all integers y > 0. 

1.17. Let y= G a € SL2(Z) with c > 0, and let ® be as in (8.4) and (8.5). 


(a) Prove that ®(yT) = ®(y) +1. 
(b) * Prove that ®(7S) = ®(y) — 3 provided that d > 0. (Hint. Use the 
definition of ® (8.5) and (8.3a) to show that 


2( (7S) -— (9) = log(cSr + d) + log(r) — log(dr — c) — e. 


Now evaluate at T = 7.) 
(c) Use (b) and (8.5) to deduce Dedekind’s reciprocity law (8.6), 


zy 1 
12s(z, 12s(y,z) = -—+=+4+— —-3. 
s(a,y) + 12s(y.0) =F + 2 + 


1.18. Let 


Po =]Ja-a"), P, =]Ja-«¢"?), 


n>1 n>1 


P, =TJa+a"), Ps = [Ja+q"?). 


n>1 n>1 
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(a) Prove that 


{e(5)- o()} = Pare, 
{6(5) - (Sy = PyP?, 
{o (+) -0 (5) } = 20h ert. 


(Hint. Use (5.6a) and the product expansion (6.4) of o.) 
(b) Prove that 


P,P2P3 = 1. 


(Hint. It’s easier to show that PoP; P2P3 = Po.) 
(c) Use (a) and (b) to prove Jacobi’s formula 


A(r) = (2m)?q [a - 4"). 


n>1 


1.19. Verify the following identities for the Hecke and homothety operators acting 
as correspondences on the space of lattices £. (Note that there are similar 
identities for the operators T2;,(n) acting on the space of modular forms of 
weight 2k which will differ from these identities by various scalar factors.) 


6 ms ELCs) 
(b) T(p’)T(p*) = So piRire") forO<r<s. 
i=0 
(c) T(m)T(n)= S_  dRaT (=) 
d| gcd(m,n) 


1.20. (a) Let f(7) be a modular function of weight 2k. Prove that 


2 2 
g = (2k4 » (4) akg SS 


is a modular function of weight 4k + 4. 
(b) If f is a modular form, prove that g is a cusp form. 
(c) If f is the Eisenstein series Ga(r), prove that 


1 
~ 24335272 


g A(r). 


Similarly, if f = Ge(r), prove that g = cGa(r)A(r), and find the value of 
the constant c. 
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1.21. 


1.22. 


For any matrix a = @ a) with real coefficients and det(a) > 0, define 


p(a,t) =cr+d. 
For any function f : H — C, define a new function f|[a]an by 
(f\lolex) (7) = (det a)*u(a,7)~** f(a). 
(a) Prove that f is weakly modular of weight 2k if and only if 


fllvlee =f for all y € SL2(Z). 


(b) Prove that 


WF OA = WFllalon> 


where wy is the differential form described in (3.7). 
(c) Verify the identities 


w(aB,7) = pla, Br)u(B,7) and (fl[elax) |[Glax = flleBlex- 


(d) Prove that the action of T2,(n) on a weight 2k modular function f is 
given by the formula 


Tox(n)f = n?*-1 S> f\la]ax- 


a@€SL2(Z)\Dn 


(See §9 for the definition of Dy.) 


Let f,g € M3,. The Petersson inner product of f and g is defined by the 
integral Abs 
dt \ dr 


(hoa) = f senator) SER, 


(Here F is the usual fundamental domain for ['(1)\H. See (1.5).) 

(a) Prove that the integral converges. (Note that f and g are assumed to 
be cusp forms.) 

(b) Prove that ( , ) is a positive definite Hermitian inner product on the 
complex vector space M$,,. 

(c) Let w(f,g) be the integrand 


(fra) = Fra Vim 7) SNe 


Prove that for any matrix a with real coefficients and det(a) > 0, and any 
functions f,g on H, 


w(f,g) oa =w (f|[a]ax, g|[alex) - 
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1.23. 


1.24. 


1.25. 


1.26. 
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In particular, if f,g € Moz and y € SL2(Z), then 


w(f,g)oy = w(f,g). 


(See exercise 1.21 for the notation f|[a]2x.) 
(d) * Prove that T2,(n) is self-adjoint with respect to the Petersson inner 
product: 


(Tor(n)f, 9) = (f, Tox(n)g) for all f,g € M8,,n>1. 


(e) If f,g € Ms), are normalized eigenfunctions for every T2,(n), prove 
that either 
(f.g)=0 or fxg. 


(f) Prove that 
{fe M8, : f is a normalized eigenfunction for all Ta (n), n > 1} 


is a basis for M$,,. 

Prove that if L(f,s) has an Euler product expansion as in (iii) of (11.1), 
then the coefficients of f satisfy the identities (i) and (ii) of (11.1). 

(a) Let Gox(r) = >> c(n)q” be the Fourier expansion of the Eisenstein 
series G2,. Prove that there are constants k1, k2 > 0, depending only on k, 
such that 


min?®-! < |e(n)| < Ken?! for alln > 1. 


(b) Let f(r) = }>e(n)q” be a modular form of weight 2k which is not 
a cusp form (i.e. c(0) # 0). Prove that there are constants k1,K2 > 0, 
depending on f, such that 


Kin?*-) < |e(n)| < Kon?*—! for all n > 1. 


(a) Prove that the normalized Eisenstein series Ex is a normalized eigen- 
function for every Hecke operator T,(n). See (7.3.1) for the definition 
of Fox. 

(b) Let f € M2zx be a modular form of weight 2k > 4 which is not a cusp 
form, and suppose that f is a normalized eigenfunction for every Hecke 
operator T2,(n). Prove that f = Ex. 

Let f € Mo, be a modular form of weight 2k > 4 which is not a cusp 
form, say f has the Fourier expansion f = c(0) + c(1)q+--- with c(0) 4 0. 
Let L(f,s) be the L-series attached to f as described in 811. 

(a) Prove that L(f,s) can be analytically continued to C \ {2k} and that 
it has a simple pole at s = 2k with residue 


= ar) 2h 
reSs=2k L(f,s) = 


(b) Let R(f,s) = (27)~*T'(s)L(f, 8). Prove that L(f,s) satisfies the func- 
tional equation R(f,2k — s) = (-1)*R(f,s). 
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1.27. 


1.28. 


Let f(r) be a cusp form of weight 2k with k an even integer. 
(a) Prove that 


L(f,k) = (k z D! DB Orc, Qrn), 


where I'(s, 2) is the incomplete P’-function 
CO 
I'(s,x) = / to~1e-* dt forsE€C,a2>0. 


(b) Prove that 


k 
Ef) = 22m) Sole" > aga 
m=1 


n>1 
(Note that this series converges quite rapidly, since from (11.2), |c(n)| grows 
no faster than n*.) 
Let f(r) = D5 c(n)q” be a cusp form of weight 2k, let p be a prime, and let 
x : (Z/pZ)" — C* 


be a primitive Dirichlet character and extend x to Z be setting x(p) = 0. 
The Gauss sum g(x) associated to x is given by the formula 


p-1 
a(x) = Dy xen’? 
b=0 
We define the the twist of f by x to be the function 


fat) = Yo e(n)x(nJa”. 
n=1 
(As usual, we set x(n) = 0 if gcd(p,n) > 1.) We will denote the associated 
twisted L-series by 


L(f,x.8) = L(F (x, -)18) = Do eln)x(n)n™*. 
(a) Prove that 


x(n) = Zax) 9 xayemen’?, 
a=0 
(b) Let R(f,x,s) be the function 
_ (P.\*° Ts) 
R(f,x;8) a (2) Ix) 9): 


Prove that R has the integral representation 
e s f(x; it) k 2k-s f(x: it) dt 
Rifsx.3) = f { cy carat ce Ga? 0 L621: Sma aat daa 
7 g(x) Ge) Wx) Jt 
(c) Prove that L(f,x,s) has an analytic continuation to all of C and that 
it satisfies the functional equation 


R(f,x,8) = (-1)*x(-1) R(X, 2k — 8). 


94 


I. Elliptic and Modular Functions 


1.29. Let a1,a2,... be a sequence of complex numbers, and suppose that there 


is a constant c > 0 such that |an| < n° for all n. Let A > 0 be a constant 
and k > 0 an integer, and define functions 


(3) = Draan*, (8) = (24) “1(s)6(9), Fr) = Dane. 


n>1 n>1 


(a) Prove that ¢(s) is absolutely convergent provided Re(s) is sufficiently 
large. 
(b) Prove that f(7) is holomorphic on H. 
(c) *Prove that the following two facts are equivalent: 
(1) ®(s) has an analytic continuation to all of C, is bounded on every 
vertical strip, and satisfies the functional equation 


@(k — s) = £H(s). 


(A vertical strip is a region of the form ci < Re(s) < ce.) 
(ID f(r) satisfies the functional equation 


CHAPTER II 


Complex Multiplication 


Most elliptic curves over C have only the multiplication-by-m endomor- 
phisms. An elliptic curve that possesses extra endomorphisms is said to 
have complex multiplication, or CM for short. Such curves have many spe- 
cial properties. For example, the endomorphism ring of a CM curve F is 
an order in a quadratic imaginary field K, and the j-invariant and tor- 
sion points of F generate abelian extensions of kK. This is analogous to the 
way in which the torsion points of G,,(C) = C* generate abelian extensions 
of Q. An important result in the cyclotomic theory is the Kronecker-Weber 
Theorem, which says that every abelian extension of Q is contained in a 
cyclotomic extension. We will prove corresponding results for a quadratic 
imaginary field kK. For example, we will show how to construct an elliptic 
curve E such that K(j(£)) is the Hilbert class field of K, and we will 
explain how to use the torsion points of £ to generate the maximal abelian 
extension of K. 

We have generally not tried to assign credit for the results described 
in this chapter but will content ourselves with mentioning Kronecker, We- 
ber, Fricke, Hasse, Deuring, and Shimura, who are largely responsible for 
that part of the theory of complex multiplication that we will cover. In 
particular, the algebraic proofs in §§4 and 5 are essentially due to Deuring, 
and the idelic description of complex multiplication in §§8 and 9 is mainly 
due to Shimura. 

The material included in this chapter barely scratches the surface of 
the theory of complex multiplication; a complete treatment of even the 
basics would fill (at least) an entire volume. The reader desiring further 
information might profitably consult the following sources, as well as the 
references they contain. We must especially acknowledge Lang [1], Serre [6], 
and Shimura [1], whose expositions strongly influenced our organization of 
this chapter. 


Borel et al. [1]: A development of the basic theory of CM using an analytic 
approach, together with some useful computational methods. 

Cassou-Nogués-Taylor [1]: The basic theory of CM is developed in the first 
few chapters, followed by the use of CM to generate rings of integers. 
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Coates [1]: The basic theory of CM is described, followed by an introduc- 
tion to the Iwasawa theory of CM elliptic curves. 

Lang [1]: Part II develops the theory of CM much as we do, with addi- 
tional material on the arithmetic properties of special values of elliptic 
and modular functions. 

Perrin-Riou [1]: Iwasawa theory for CM elliptic curves. 

Serre [6]: A very brief, but beautifully written, summary of the main 
theorems of CM for elliptic curves. 

Shimura [1]: The idelic formulation of CM for elliptic curves is covered in 
Chapter 5 and is extended to abelian varieties in §§85.5 and 7.8. For a 
more complete treatment of the theory of complex multiplication on 
abelian varieties, see Shimura-Taniyama [1]. 

Vladut [1]: A nice historical account of Kronecker’s Jugendtraum, the 
theory of complex multiplication, and the relationship with the theory 
of modular forms. 


The main prerequisite for this chapter is some familiarity with the basic 
theorems of class field theory. We have provided in §3 a resumé (without 
proof) of the results we will need. We also assume that the reader is familiar 
with basic properties of elliptic curves over the complex numbers. 


§1. Complex Multiplication over C 


In this section we are going to discuss elliptic curves with complex multi- 
plication from the viewpoint of complex analysis. Although interesting in 
its own right, this should be viewed mainly as the preparation needed to 
study arithmetic questions. 

Let E/C be an elliptic curve with complex multiplication. We know 
from [AEC VI.5.5] that End(£) @Q is isomorphic to a quadratic imaginary 
field and that End(£) is an order in that field. If End(#) © RCC 
and kK = R®Q, then we will say that that “E has complex multiplication 
by R” or that “E has complex multiplication by K.” We also let 


Rx = ring of integers (maximal order) of K. 


Much of the theory becomes easier if one restricts attention to elliptic curves 
with complex multiplication by Rx, so we will usually take this course. For 
the general theory, see Lang [1] or Shimura [1, Ch. 5]. 

The uniformization theorem for elliptic curves [AEC VI.5.1] says that 
for every elliptic curve E/C there is a lattice A C C and an isomorphism 


fe Ci 2S E(C) 
z > (p(z,A),@'(z,A)). 
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We will denote the elliptic curve corresponding to a lattice A by Ey; it is 
given by the usual Weierstrass equation 


Eq: y? = 4x? — go(A)ax — g3(A). 


If & has complex multiplication, then there are two ways to embed the 
order End(£) into C. It is important to pin down one of these embeddings. 
This is done by the following proposition, which also provides an important 
tool for studying arithmetic properties of various analytically defined maps. 
The reader might compare Proposition 1.1 with [AEC III.5.3], which gives 
the case that a € Z. We will use Proposition 1.1 to make deductions in a 
manner similar to the way we used [AEC III.5.3] to deduce [AEC JII.5.4] 
and [AEC III.5.5]. 


Proposition 1.1. Let E/C be an elliptic curve with complex multipli- 
cation by the ring RC C. There is a unique isomorphism 


[-]: R — End(£) 
such that for any invariant differential w € Ng on E (see [AEC III §5]), 
[a]*w = aw for alla € R. 


We say in this case that the pair (E,[ - ]) is normalized. 


PROOF. Choosing a lattice A and an isomorphism EF = Ej, it suffices to 
prove the proposition for E,. (Note that [AEC III §1, Table 1.2] says 
an isomorphism has the effect of multiplying an invariant differential by a 
constant.) 
Next we recall [AEC VI.5.3] that the endomorphism ring of E, is 
isomorphic to 
{faeC:aAcCAS=RCC. 


More precisely, each a € R gives an endomorphism [a] : Ex, — Ex, deter- 
mined by the commutativity of the following diagram: 


We claim that this map [-] : R —> End(£) satisfies [a]*w = aw. 

To verify our claim, we first note that any two non-zero invariant 
differentials on HE, are scalar multiples of one another. This follows trivially 
from the fact that their quotient would be a translation invariant function, 
hence would be constant. So if we take any invariant differential w € Og 
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and pull back via the isomorphism f : C/A — E,(C), we obtain a multiple 
of the invariant differential dz on C/A, say 


ftw =cdz. 


Now tracing around the commutative diagram shown above gives the de- 
sired result: 


[a]*w = (f~")* 0650 f*(w) 
= (f')* 0b (edz) = (f7")*(cadz) = aw. 
Qo 


Corollary 1.1.1. Let (E1,[-]z,) and (E2,[-]z,) be normalized elliptic 
curves with complex multiplication by R, and let ¢: FE, — Ez be an 
isogeny. Then 


60 fale, =lalz,o¢ for alla€é R. 


PROOF. Let 0 Aw € Op, be an invariant differential. Then 


($0 [a]z,)"w = [a]h, (*w) 
=ag*w since ¢*w is an invariant differential on FE, 
= ¢* aw 
= 6" ([ole.) 
= (a]is, 0 6)" w. 


Every non-zero isogeny E, — E 2 is separable (we’re working in character- 
istic 0), so [AEC IT.4.2c] says that the map 


Hom(£}, E2) caro Hom(Q¢,,Qz,), w od y*, 
is injective. Therefore ¢ 0 [a]z, = [a]p, o ¢. oO 


We have seen in Chapter I that in order to understand particular ellip- 
tic curves, it is often useful to study the set of all elliptic curves. Similarly, 
in order to study a particular elliptic curve with complex multiplication, 
it turns out that one should look at the set of all elliptic curves with the 
same endomorphism ring. Of course, by “elliptic curves” we really mean 
isomorphism classes of elliptic curves, which leads us to define the following 
si {elliptic curves E/C with End(E) © R} 

isomorphism over C 


_ {lattices A with End(£,) = R} 
homothety ‘ 


ELL(R) = 
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If we start with a quadratic imaginary field K, how might we construct 
an elliptic curve with complex multiplication by Rx? Ifa is a non-zero ideal 
of Rx, or more generally if it is a non-zero fractional ideal of K, then using 
the embedding a Cc K C C we see that a is a lattice in C. (This is clear 
from the definition of fractional ideal, which for quadratic imaginary fields 
implies that a is a Z-module of rank 2 which is not contained in R.) Hence 
we can form an elliptic curve E, whose endomorphism ring is 


End(£,) = {ae C: aac a} 
={aeK :aaca} sincea C K 


= RK since a is a fractional ideal. 


Thus each non-zero fractional ideal a of K will give an elliptic curve 
with complex multiplication by Rx. On the other hand, since homothetic 
lattices give isomorphic elliptic curves, we see that a and ca give the same 
elliptic curve in ELL(Rx). This suggests that we look at the group of 
fractional ideals modulo principal ideals, which the reader will recognize as 
one of the fundamental objects of study in algebraic number theory: 


C£L(Rx) = ideal class group of Rx 


_ {non-zero fractional ideals of K } 
~ {non-zero principal ideals of Kk} 


If a is a fractional ideal of AK’, we denote by a its ideal class in CL(Rx). 
We have seen that there is a map 


CL(Re) — ELL (Rx), ar— Fy. 


More generally, if A is any lattice with E, € ELL(Rx) and a is any non- 
zero fractional ideal of K, we can form the product 


aA = {ajAy +--+ + a,A, >a; Ea, Ai € A}. 


We will now prove the elementary, but crucial, fact that this induces a 
simply transitive action of the ideal class group C£L(Rx) on the set of 
elliptic curves ELL(Rx). This proposition forms the basis for all of our 
subsequent work on complex multiplication. 


Proposition 1.2. (a) Let A be a lattice with E, € ELL(Rx), and let a 
and 6 be non-zero fractional ideals of K. 
(i) aA is a lattice in C. 
(ii) The elliptic curve Ea satisfies End(Eaa) = Rx. 
(iii) Ean @ Epa if and only if a=b in CL(Rx). 
Hence there is a well-defined action of CL(Rx) on ELL(RK) determined 


by 
ax Ha = Ey-1,.- 
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(The reason for using a~! instead of a will become apparent below.) 
(b) The action of CL(Rx) on ELL(Rx) described in (a) is simply transi- 
tive. In particular, 


# CL(Rx) = #ELL(Rx). 


PROOF. (a) (i) By assumption, End(E,) = Rx, so RxA = A. Choose a 
non-zero integer d € Z so that da C Rx, which is possible by the definition 


of fractional ideal. Then aA Cc —A, so aA is a discrete subgroup of C. Simi- 
larly, choosing a non-zero integer d so that dRx C a, we find that dA C aA, 


hence aA spans C. This proves that aA is a lattice. 
(ii) For any a € C and any fractional ideal a 4 0, we have 
aaA CaA = > a laaA ca taA =} aACA. 
Hence 
End(£aa) = {a €C : aaA Cc aA} 
={aeEC: adc A}=End(Ea) = Rx. 


(iii) From [AEC VI.4.1.1], the isomorphism class of Faq is exactly deter- 
mined by the homothety class of aA. In other words, Egan & Ea if and 
only if there is ac € C* such that aA = cbA. Multiplying by a7! and using 
the fact that Ry A = A, we see that 

Ean = Exn => A= ca 'bA. 
Similarly, multiplying by c~1b~! gives 


Ean = Eon <=> A=cu'ab'A. 


Hence if Eg, & Epa, then both ca~!b and c~!ab—! take A to itself, so they 
are both contained in Rx, and hence are equal to Rx. Therefore 


a= cb, 
from which we see immediately that c € K and a = 6. This completes the 
proof of (iii). 
Finally, the trivial observation 


a* (b * Ex) =a* Eg-14 = Fig-1(6-1A) = E(ab)-1A = (ab) # Ea 


shows that the definition @* Ey, = E,-1, gives a group action of CL(Rr) 
on ELL(Rx). 
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(b) Let E,, and E,, be two elliptic curves in E€£L(Rx). To show that 
the class group CL(Rx) acts transitively on ELL(Rxe), we must find a 
fractional ideal a with the property a* E,, = E,a,. Choose any non-zero 


1 
element A, € A,, and consider the lattice a; = 7 At From [AEC VI.5.5] 


1 
we see that a; is contained in K, and by assumption it is a finitely gen- 
erated Rx-module, hence it is a fractional ideal of K. Similarly, choosing 


a non-zero Ag € Ag, we obtain a second fractional ideal ag = x he of K. 
2 


Then 


A2 0 
aaceed TAY = Ao. 


So if we let a = ayy, then 
ax Fa, = FEu-1n, = Ex, = Ep,. 
2 


Note the last equality follows from the fact that homothetic lattices give iso- 
morphic elliptic curves. This shows that the action of CL(Rx) on ELL(RK) 
is transitive. 

To prove that the action is simply transitive, we must show that if 
a* Ex, =b* Ey, then a= 6. But this is immediate from part (ii) of (a). 


We have already seen two sorts of elliptic curves which have complex 
multiplication, namely the curves with 7 = 0 and j = 1728 whose auto- 
morphism groups are strictly larger than {+1}. (See [AEC III.10.1].) Now 
we'll look at these curves from a complex analytic viewpoint. 


Example 1.3.1. Let A = Z/i] be the lattice of Gaussian integers. Then 
the endomorphism ring of Ey, is Z[{i]. In particular, Aut(E,) & {+1, +7}, 
so our general theory [AEC III.10.1] tells us that 7(F,) = 1728. But we 
can see this directly in the following way. The lattice A satisfies iA = A. 
Hence 

ga(A) = gs(iA) = i°g3(A) = —g3(A), 


so g3(A) = 0. Therefore E, is given by the Weierstrass equation 
E, : y* = 423 — go(A)z, 


from which we see immediately that j(£,) = 1728. 

Since j(£,) is rational, we know that EF, is isomorphic over C to an 
elliptic curve defined over Q; for example, it is isomorphic to the curve y? = 
xz? +2. But it does not follow that go(A) itself is in Q. In fact, a theorem 
of Hurwitz [1] says that 


4 


g2(Z[i]) = 64 (f a) : 
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Example 1.3.2. Similarly, let p = e?7*/° be a primitive cube root of unity, 


and let A = Z[p] be the associated lattice. Then pA = A, so 
g2(A) = go(pA) = p*g2(A) = pga(A), 
and hence go(A) = 0. Thus EF, is given by the equation 
Eq, :y? = 4x? — g3(A), 
so j(E,) = 0. This confirms [AEC III.10.1], since Aut(E,) = Z[p|* = 


{+1,+p,+p?}. Further, we see that Ey, is C-isomorphic to the curve 
y? = 2° +1, which is defined over Q. 


If E has complex multiplication by K, we will eventually use torsion 
points of & to generate abelian extensions of kK. We could restrict ourselves 
to studying points of order m for various integers m, but because F has 
complex multiplication, there are other natural finite subgroups to look at. 
In general, if a is any integral ideal of Rx, we define 


Ela] ={PeE: [a]P =0 for alla € a}. 
We call Ela] the group of a-torsion points of E. For example, if a =mRx, 
then E[a] is just E[m]. Notice that the definition of E[a] depends on 
choosing a particular isomorphism [-] : Ry — End(E); we always choose 
the normalized isomorphism described in (1.1). 


If a is an integral ideal of Rx, then A C a~!A. This means that there 
is a natural homomorphism 


C/A — C/a'A, Ze Z, 
which in turn induces a natural isogeny 
En — ax Ex. 


The following useful proposition gives a precise description of this isogeny 
and of E[a]. 


Proposition 1.4. Let E € ELL(Rx), and let a be an integral ideal 
of Rr. 

(a) Ela] is the kernel of the natural map E > a* E. 

(b) Ela] is a free Rx /a-module of rank 1. 


Proor. Let A be a lattice corresponding to F. Fixing an analytic isomor- 
phism C/A = E(C), we find that 


Ela] = {z € C/A : az =0 for alla € a} 
={zE€C: azEA forallaca}/A 
={zeC:zacA}/A 
=a 'A/A 
= ker (C/A pay C/a-1A) 
= ker(E > ax E). 
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(b) Continuing with the notation from (a), we choose a non-zero lattice 
element \ € A. Then [AEC VI.5.5] says that the lattice (1/A)A is contained 
in K, and it is a finitely generated Rx-module, so it is a fractional ideal 
of K. Since homothetic lattices give isomorphic elliptic curves, we may 
assume that A is a fractional ideal of Kk. 

From (a) we know that E[a] © a~'A/A as Rx/a-modules. Note that 
if q is any integral ideal dividing a, then the fact that Rx A = A implies 


(a-'A/A) @rx (Rw /q) a7 A/(A+qa7tA) = a7 A/qat A. 
Hence if we use the Chinese Remainder Theorem to write 


Re/a= |] Rx/p, then Ela] = II at A/pe® amt. 


p prime p prime 


So it suffices to prove that if 6 is a fractional ideal of Rx (such as b = a~‘A) 
and if p® is a power of a prime ideal, then b/p°b is a free Rx /p®-module of 
rank one. 

To ease notation, we momentarily write 


Behe /p.. pS pie, and sb’ = b/p*b. 


Notice that R’ is a local ring with maximal ideal p’. (In fact, the only 
ideals in R’ are (0),p’°~',---,p’,(1).) Consider the quotient 


b’/p’b’ = b/pb as a vector space over the field R’/p’ ~ Rx /p. 


We claim that it is a one-dimensional vector space. 

First we observe that any two elements of b are Rx-linearly dependent, 
so the dimension of b/pb over Rx /p is at most one. On the other hand, 
if the dimension were zero, then we would have 6 = pb, which is absurd. 
Hence the dimension is one. By Nakayama’s lemma (Atiyah-MacDonald [1, 
Prop 2.8]) applied to the local ring R’ and the R’-module b’, it follows 
that 6’ is a free R’-module of rank one. This completes the proof of Propo- 
sition 1.4. 

oO 


We can use (1.4) to compute the degree of the isogeny E — ax E, as 
well as the degree of an endomorphism [a] : E > E. 


Corollary 1.5. Let E € ELL(RxK). 

(a) For all integral ideals a C Rx, the natural map E — @* E has de- 
gree NG a. 

(b) For all a € Rx, the endomorphism [a] : E — E defined in (1.1) has 
degree |N§ a. 

ProoF. Both parts are immediate from (1.4). For example, 


deg(E > a* E) = #E[a] from (1.4a) 
= Noa from (1.4b). 
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degla] = # ker[a] = #E[oRx] = NG (ARK) = |NGal. 
O 


Remark 1.6. Before going on to arithmetic questions, we want to make 
one brief remark about terminology. The classical name for the j-invariant 
of an elliptic curve with complex multiplication is a singular j-invariant. 
This terminology, meant to single out such j-invariants as being unusual, 
is somewhat unfortunate, since it suggests that the elliptic curve itself has 
singularities. We will not use the word “singular” in this sense, but the 
reader should be aware of this usage, since it is still fairly common. 

Notice that an elliptic curve defined over a finite field always has a 
“singular” j-invariant, since its endomorphism ring is always larger than Z 
[AEC, V.3.1]. In those rare cases that the endomorphism ring is a quater- 
nion algebra, the singularity is especially exceptional, which explains the 
origin of the term “supersingular” to describe such curves. 


§2. Rationality Questions 


In this section we will study the field of definition for complex multiplica- 
tion elliptic curves and their endomorphisms. We begin by showing that 
every elliptic curve with complex multiplication is defined over an algebraic 
extension of Q. 


Proposition 2.1. (a) Let E/C be an elliptic curve, and let a :C — C be 
any field automorphism of C. Then 


End(E’) © End(£). 


(b) Let E/C be an elliptic curve with complex multiplication by the ring 
of integers Rx of a quadratic imaginary field K. Then j(E) € Q. (Later 
we will show that j(E) is an algebraic integer. See (II §6) and (V.6.3).) 


(c) 


SLL(Ry) & {elliptic curves B/Q with End(E) © Rx} 
isomorphism over Q 


(Note that the original definition of ELL( Rx) is in terms of isomorphism 
classes of elliptic curves over C, not over Q.) 


Proor. (a) This is clear, since if 6: EF — E is an endomorphism of E£, 
then @° : E” — E® is an endomorphism of E”. 
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(b) Let o € Aut(C) be as in (a). Now E® is obtained from E£ by letting o 
act on the coefficients of a Weierstrass equation for E, and j(£) is a rational 
combination of those coefficients, so it is clear that 


j(E*) = j(E)’. 


On the other hand, (a) implies that End(£’) = Rx, so (1.2b) implies 
that EH? is in one of only finitely many C-isomorphism classes of elliptic 
curves. Since the isomorphism class of an elliptic curve is determined by 
its j-invariant [AEC III.1.4b], it follows that j(£)? takes on only finitely 
many values as o ranges over Aut(C). Therefore [Q(j(E)) : Q] is finite, 
so j(£) is an algebraic number. 

(c) For any subfield F' of C, let us momentarily denote by ELL (Rx) the 
i _ {elliptic curves E/F with End(£) = Rx} 


Tk 
ELL r(Rx) isomorphism over F’ 


If we fix an embedding Q Cc C, then there is a natural map 
Ee. ELLa(Rx) = ELLc(Rx). 


We need to show that this map is a bijection. 
Let E/C represent an element of ELL¢(Rx). Then we have: 
(i) j(Z) € Q, from (b); 
(ii) there is an elliptic curve E’/Q(j(E)) with j(£’) = j(E£), from 
[AEC III. 1.4]; 
(iii) E’ is isomorphic to E over C, from [AEC III.1.4b]. 


These three facts imply that ¢(£’) = EF, which proves that «€ is surjective. 

Next let E;/Q and E2/Q represent elements of €LL4(Rx), and sup- 
pose that e(£)) = e(£2). Then j(£1) = j(£2) from [AEC III.1.4b], and 
another application of [AEC III.1.4b] says that E, and EF, are isomorphic 
over Q. Hence £; and E represent the same element of ELLa(Rx), which 
shows that ¢ is also injective. Oo 


Next we study the effect that field automorphisms have on the maps 
[a] : E — E described in (1.1). In particular, we will find a field of defini- 
tion for these maps. Note that if ¢ is an endomorphism of EF and a is any 
automorphism of C, then ¢7 will be an endomorphism of E?. 


Theorem 2.2. (a) Let E/C be an elliptic curve with complex multiplica- 
tion by the ring R CC. Then 


[a] e” = [a7] zo for alla € R and allo € Aut(C), 


where the isomorphisms [-]~ : R —>+ End(E) and [-]go- : R —> End(E’) 
are normalized as in (1.1). 
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(b) Let E be an elliptic curve defined over a field L C C and with com- 
plex multiplication by the quadratic imaginary field K Cc C. Then every 
endomorphism of FE) is defined over the compositum LK. 

(c) Let E\/L and E2/L be elliptic curves defined over a field L C C. Then 
there is a finite extension L'/L such that every isogeny from EF to E is 
defined over L’. 


PROOF. Let w € Q- be a non-zero invariant differential on E. Then the 
normalization described in (1.1) says that 


[a]zw = aw for alla € R. 
Further, w® is an invariant differential on E7, so again from (1.1) we get 
(Blow? = Bw? for all BE R. 
Now for any a € R and any o € Aut(C), we compute 
([a]e7)"(w”) = (lalisw)” = (aw)? = 076 = [al pe (w”). 


Thus [a]~_° and [a”]z2 have the same effect on the invariant differential w. 
Now we use [AEC II.4.2c], which says that the natural map 


End(£”) — End(Q¢-), prov, 


is injective. (Note we are working in characteristic 0, so all finite maps are 
separable.) This proves that [a] _.” = [a7] po. 

(b) Let o € Aut(C) be an automorphism of C that fixes L. Since E is 
defined over L, we can take a Weierstrass equation for E with coefficients 
in L, so E° = EF. Then (a) says that for all a € R, 


lle” = [a7] go = [07 |p. 
If in addition o fixes K, then a7 = a. This proves that 


laln” = [ale for all o € Aut(C) such that o fixes LK. 


Hence the endomorphism [a] is defined over LK. 

(c) Asin (b), we take Weierstrass equations for FE, and E2 with coefficients 
in L. Let ¢ € Hom(£}, F2) be an isogeny. Then for any o € Aut(C) such 
that o fixes L, we have 6° € Hom(F, F2). Note that deg @7 = deg ¢. 
From [AEC III.4.11], we see that an isogeny ¢ € Hom(F}, E2) is determined 
by its kernel, at least up to an automorphism of £, and EF». Since £, has 
only finitely many subgroups of any given finite order, and since Aut(£1) 
and Aut(£2) are finite, it follows that Hom(£,, F2) contains only finitely 
many isogenies of a given degree. Therefore the set 


{¢” : ¢ € Aut(C), o fixes L} 
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is finite, which implies that @ is defined over a finite extension of L. Fi- 
nally, we observe from [AEC III.7.5] that Hom(£, £2) is a finitely gener- 
ated group, so it suffices to take a field of definition for some finite set of 
generators. O 


Remark 2.2.1. Notice that the proof of (2.1b) together with the estimate 
in (1.2b) shows that if End(£) = Rx, then 


[Q(s(Z)) : Q| < hx, 


where hy = #CL(Rx) is the class number of K’. We will prove later (4.3) 
that this is an equality. In particular, j(£) is in Q if and only if K has 
class number 1. For a complete list of these Q-rational j-invariants, see 
Appendix A §3. 


Remark 2.2.2. In view of (2.2.1), we see that if Rx has class number 1, 
then FE has a model defined over Q. We have already seen examples of this 
in (1.3.1) and (1.3.2), where we looked at curves with complex multiplica- 
tion by Z[i] and Z[p]. (Here p = e27*/3.) We can also illustrate (2.2) for 
these curves. For example, to normalize the curve 


E:y=se54+a, 
we use the isomorphism [-] : Z{] ~ End(£) determined by 
[}(x,y) = (—2, ty). 
To see that this is the correct normalization, we compute 


dz d(—2) _ 4 
y iy yo 


[i] 


If ao € Aut(C) is complex conjugation, then 


([él(z,y))” = (—2, ty)? = (—2”,i7y”) 
= (—27, —ty®) = [—-i](x7, y7) = [27] (27, y®). 


Hence [7]” equals [27], as it should by (2.2). 
Similarly, for the curve 


E:y=2°+4+1 
we take the isomorphism [-] : Z[p] — End(£) determined by 


[el(z,y) = (px, y). 
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Remark 2.2.3. There is an interesting converse to (2.1b). Suppose that 
A = Zw, + Zur is a lattice satisfying 


3 < [Q(w1/we) : Q) < 00; 


that is, w;/we is an algebraic number of degree at least 3 over Q. Then one 
can show that j(£,) is a transcendental number. Notice the analogy with 
the Gel’fond-Schneider theorem, which says that if a € Q with a # 0,1 
and if @ satisfies 2 < (Q(8) : Q| < oo, then a® is transcendental. The 
transcendence of j(E,) was first proven by Schneider. The interested reader 
will find a proof of this fact in Schneider [1, Thm. 17] or Waldschmidt [1, 
Cor. 3.2.4]. For a general account of the transcendence properties of elliptic 
and modular functions, see for example Waldschmidt [1, Ch. 3]. 


It is an immediate consequence of (1.4b) and (2.2b) that the torsion 
points of E generate abelian extensions of K(j(E)). Before giving the 
proof, we remind the reader of the analogous result for cyclotomic fields. 
Thus let ¢ € C* be a primitive N**-root of unity and let o € Gal(Q(¢)/Q). 
Then ¢? is another primitive N**-root of unity, say ¢7 = ¢°%), and it is 
an easy matter to check that the map 


p : Gal(Q(¢)/Q) — Aut(uw) & (Z/NZ)* 


is an injective homomorphism. (Here wy = C% is the group of N*t*-roots 
of unity.) Hence Q(¢)/Q is an abelian extension. We now prove the same 
thing for elliptic curves. (For another proof, see exercise 2.6.) 


Theorem 2.3. Let E/C be an elliptic curve with complex multiplication 
by the ring of integers Rx of the quadratic imaginary field K, and let 


L = K(j(E), Evors) 


be the field generated by the j-invariant of E and the coordinates of all 
of the torsion points of E. Then L is an abelian extension of K (j(E)). 
(N.B. In general, L will not be an abelian extension of K.) 


PRooF. To ease notation, let H = K(j(£)). Further let 
Lm = K(j(E), E[m)) = H(Elm)) 


be the extension of H generated by the m-torsion points of £. Since L is 
the compositum of all of the L,,’s, it suffices to show that L,, is an abelian 
extension of H. 

As usual, there is a representation 


p: Gal(K/H) — Aut(E[m]) 
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determined by the condition 
p(o)(T) =T? for all o € Gal(K/H) and all T € E[m]. 


(See [AEC III §7].) For an arbitrary elliptic curve, all we would be able to 
deduce from this is that Gal(L,,,/H) injects into the automorphism group 
of the abelian group E[m], so we would find that Gal(L,,/H) is isomorphic 
to a subgroup of GL2(Z/mZ). 

But the fact that our elliptic curve has complex multiplication gives us 
additional information. We take a model for E defined over H = K(j(E)), 
and then (2.2b) says that every endomorphism of £ is also defined over H. 
So elements of Gal(L,,/H) will commute with elements of Rx in their 
action on E[m|: 


({eJT)” = [al(T?) for allo € Gal(L,,/H), T € Elm], anda € Rx. 


In other words, p is actually a homomorphism from Gal(K/H) to the group 
of Rx /mRx-module automorphisms of E[m]. Hence p induces an injection 


¢ : Gal(Lm/H) — Autre, jmrx (E[m]). 


Now we use (1.4b), which says that E[m] is a free Rx /mR«-module 
of rank one. This implies that 


Aut rx /mrx (Elm) = (Rx /mRx)" , 
and hence Gal(L,,/H) is abelian. Oo 


Before proceeding with the general theory, we will pause to construct a 
few more examples of elliptic curves having complex multiplication. More 
precisely, we will find all elliptic curves that possess an endomorphism of 
degree 2. We already know one such curve, namely the curve y? = 2? +2 
with complex multiplication by Z|i], since the map [1 + 7%] has degree 2. 
From (1.5b), we need to find all quadratic imaginary fields K that have an 
element a € Rx satisfying |N§a| = 2. This is an easy exercise (which we 
leave to the reader), the answer being that there are three such fields: 


K=Q(vV-1), Rx =Z[Vv-1], a=1+V-1; 
K=Q(vV-2), Rx =Z[V-2], a= /—2; 
K=0@/—%), Ry = 2 |"), gee 


Since all three of these rings Rx have class number 1, we know from (2.2.1) 
that the corresponding elliptic curves have j-invariants in Q. 
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How can we find equations for these curves and their endomorphisms? 
It is possible to proceed analytically, but we will take another approach. 
If ¢: E — E has degree 2, then its kernel E[¢] consists of two points, O 
and a point of order 2. If we move the point of order 2 to (0,0), then F 
will have a Weierstrass equation of the form 


E:y? =a? + ax? + ba. 


For this elliptic curve E, we have already determined an elliptic curve E’ 
and an isogeny ¢: E — E’ whose kernel is {O, (0,0)}, namely 


E!: Y¥? = X%_2aX?4(a?—4b)X, (x,y) = (erat zu (1 _ =)). 


(See [AEC ITII.4.5], although note we have taken the negative of the isogeny 
defined there and have substituted x? + ax? + br for y*.) Hence F will 
possess an endomorphism of degree 2 if and only if this E’ is isomorphic 
to E. 

To see when F£ and E’ are isomorphic, we set their j-invariants to be 
equal and solve for a and b, or more precisely for the ratio a?/b. Now 


pn _ 206(a? — 3b)3 ~an _. 16(a? + 12b)8 
al a) a Ce eT ORY 1 


Setting j(£) = j(E’), we find after some calculation that 
16b(a? — 4b)a?(a? — 8b)(16a4 — 81a7b + 32467) = 0. 


The first two cases, b = 0 and a? — 4b = 0, give singular curves, so 
we discard them. The third case, a = 0, gives the curve y® = x° + bz 
with j(E) = 1728 and complex multiplication by Z[i]. 

Next consider the case a? — 8b = 0. Taking b = 2 and a = 4 gives the 
curve 

E:y? =2° 4+ 42? + 22 


with j(E) = 8000 = 2°53. Similarly, E’ is given by the equation 
E’: Y? = X? —8X7 +8X 


with j-invariant j(£") = 8000. Hence £ and E’ are isomorphic, and we 
easily find an isomorphism 


iy 


E' — E, (X,Y)r> (-5 


-3 737) 
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Composing the isogeny & — E’ with this isomorphism, we obtain the 
desired endomorphism of F of degree 2: 


V2 
E:y?=234+ 40? 4 2x Rez E 
1 2 y 2 
ee Bae foc Ua Le pl), 
a) (9 (#4443) pe -)) 


There remains the case 16a* — 81a*b + 324b*, which (taking a = 36) 
leads to the elliptic curves 


E:y’ =2° +3627 + 18 (9+ 5V-7) a, 
E’:Y? = X? —72X? +.72(9-5V-7) X 


with j(£) = j(E’) = —3375 = —335°. We will leave it to the reader to 
make the appropriate variable changes which lead to models for EF’ and E’ 
over Q and to an explicit formula for the corresponding endomorphism 
of degree 2. The final answer is given in the following summary of our 
calculations. 


Proposition 2.3.1. There are exactly three isomorphism classes of el- 
liptic curves over C which possess an endomorphism of degree 2. The 
following are representatives for these curves and endomorphisms. 


(i) E:y=25+2, j = 1728, a=1+VJ/-1, 
alte.» = (a? (242) .an%y (1-5 )); 
(ii) E:y?=234+42?+22, 4 = 8000, a= vV-2, 
fal(x,y) = (0 (« +44 :) Ja fy (1 - =)) : 
Oy Sepa as Scere eet 


D 
la](z,y) = Ge (« ~ fists) a 3y (1 + tS"55)) 


We now resume our development of the general theory of complex 
multiplication. From here on we will use (2.1) to identify ELL(Rx) with 
the Q-isomorphism classes of elliptic curves having complex multiplication 
by Rx. Then there is a natural action of Gal(K/K) on ELL(RxK) defined 
by the property that o € Gal(K/K) sends the isomorphism class of E to 
the isomorphism class of £7. On the other hand, (1.2b) says that the action 
of the class group CL(Rx) on ELL(Rx) is simply transitive, so there is 
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a unique a € CL(Rx), depending on o, such that @* EF = E?’. In other 
words, there is a well defined map 


F : Gal(K/K) — C£(Rx) 
characterized by the property 
E°=F(o)*E for allo € Gal(K/K). 


It is by studying this map F that we will be able to precisely describe 
the field K (j(E)). An easy property of F’, which we will prove below, is 
that F is a homomorphism. A much deeper property, which we will also 
prove, is that F' is independent of the choice of the curve E € ELL(Rx). 
The astute reader will have noticed that F is actually well defined on the 
larger group Gal(Q/Q). However, it is only on the smaller group Gal(K/K) 
that F will be independent of E. 

Before proving these basic properties about F’, we want to stress that 
the definition of F has an essential analytic component, since F(a) depends 
on the way in which the lattice of an elliptic curves changes when the lattice 
is multiplied by an ideal. Thus if we denote by j(A) the j-invariant of 
the elliptic curve E,, then as described in Chapter I, j(A) is an analytic 
function of A. The map F is then characterized by the formula 


(A)? = j(F(e)*A), 
so F converts the algebraic action of o into the analytic action of multipli- 
cation by F(a)7!. 


Proposition 2.4. Let K/Q be a quadratic imaginary field. There exists 


a homomorphism 7 
F: Gal(k/K) — C£L(Rx) 


uniquely characterized by the condition 


E° = F(o)*E for allo € Gal(K/K) and all E € EL£(Rx). 


Proor. As described above, (2.1) and (1.2b) ensure that for any element 
o € Gal(K/K) and any E € €£LL(Rx), there is a unique @ € CL(Rx) with 
E?’ =a E. So for a fixed E, we get a well-defined map 


F : Gal(K/K) > C£L(Rx) 


determined by the property E° = F(c) * E for all o € Gal(K/K). It is 
easy to check that F' is a homomorphism, since 
Flor)*E=E =(E')? = (F(7) * E)’ 
= F(o) * (F(r) * E) = (F(o)F(r)) * E. 
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(Note that Gal(K/K) acts on the left.) 

It remains to show that the definition of F' is independent of the choice 
of a particular elliptic curve in ELL(RxK). So let Fi,Eo € ELL(Rx), 
let o € Gal(K/K), and write EZ = 4, * E; and EJ = G2 * Ey. We need to 
show that a; = dy. Since CL(Rx) acts transitively on ELL(Rx), we can 
find some 6 with E = 6 * Ey. Then 


(6 * E,)?” = ES = a2 * Eg = a2 * (6 * E)) = (agba; ') * EY. 


So if we can prove that (6 * E,)? is equal to 6 * EZ, then we can cancel 6 
from both sides to conclude that EZ = (aaj!) x EZ; and then (1.2(iii)) 
will give G; = G2. Hence the following proposition completes the proof of 
Proposition 2.4. (Note that b” = 6, since b C K and o € Gal(K/K).) 


Proposition 2.5. Let E/Q be an elliptic curve representing an element 
of ELL(Rx), let a € CL(Rx), and let o € Gal(Q/Q). Then 


(@* E)? =a? * EY. 


Although the statement of Proposition 2.5 looks relatively innocuous, 
it is giving a relationship between the algebraic action of o and the ana- 
lytic action of multiplication by a. This suggests that the proof may not 
be entirely straightforward. The main idea is to find an algebraic descrip- 
tion of d* E. One of the tools we will need is the following lemma from 
commutative algebra, whose proof we leave as an exercise. 


Lemma 2.5.1. Let R be a Dedekind domain, let a be a fractional ideal 
of R, and let M be a torsion-free R-module. Then the natural map 


¢@: a tM — + Homa(a, M) 


x ro (d,:a az) 
is an isomorphism. 


PROOF (of Proposition 2.5). Choose a lattice A so that E & Eq. Also fix 
a resolution (i.e., an exact sequence) 


REA Re 3 oO (i) 


where A is an m x n matrix with coefficients in Rx. The idea underlying 
the proof of Proposition 2.5 is that we should have 


C/a“'A ax E = Hom(a, E), 


where we want to describe Hom(a, £) as an algebraic variety and not just 
as an Rx-module. (Here and in the following, Hom means homomorphisms 
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of Rx-modules.) We begin by applying Hom to the “product” of the exact 
sequence (i) and the exact sequence of Rx-modules 


0— A—C— E — 0. (ii) 


This gives us the following commutative diagram: 


0 0 0 
0 — > Hom(a,A) — Hom(a,C) —> Hom(a, £) 

| | (iii) 
0 — > Hom(R%,A) —> Hom(R%,C) —> Hom(R%, £) 

[A A |4 


0 —> Hom(RF,A) —> Hom(R¥,C) —> Hom(R?, £) 


For any Rx-module M, we have Hom(R},,M) = M”, and applying 
Lemma 2.5.1, first with M = A and then with M = C, we get 


Hom(a,A)=a7'A and  Hom(a,C)=a !C=C. 


Using these isomorphisms, we can rewrite the diagram (iii) as 


i (iv) 


0 — A” — Cc” —-, tha — 0 
[u ‘4 t4 
0 — A™ —> CC” — E™ — 0 


Here ‘A is the transpose of the matrix A, and the bottom two rows are 
clearly exact on the right, since they are just a number of copies of the 
exact sequence (ii). 

Applying the snake lemma to the bottom two rows of (iv) gives the 
exact sequence 


0 a-1A —+ C — (ker BE” 4 E™) — A/*AA™. (v) 
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Notice that E” 2 E™ is an algebraic map of algebraic varieties, since ‘A is 
an m xX n matrix whose coefficients are elements of End(£) = Rx. Hence 
the inverse image of the point (0,0,...,0) € E” is an algebraic subvariety 
of E”. Of course, E” and E™ are group varieties, so what we are saying is 


that the kernel of £” A E™ is an algebraic group variety. Further, (2.2a) 
says that for any o € Aut(C), the corresponding map from E?” > E7™ 
is obtained by applying o to the entries of ‘A, treating those entries as 
elements of Rx CC. 

On the other hand, looking at the complex topology for one more mo- 
ment, we note that A”/‘4A”™ is discrete and C/a~'A is connected. Hence 
the exact sequence (v) gives 


(a * E)(C) = C/a7'A & identity component. of ker(E” ie E™), 


We have thus described a * FE algebraically in terms of the algebraic 


map E£” a E™, and it now easy to finish the proof of Proposition 2.5. 
For any o € Gal(Q/Q), we apply our characterization first to E and then 
to E° to deduce that 


(ax E)? = (identity component of ker(E” 4 E™)) 


= identity component of ker ((E)" a (E7)") 
=a’ x E. 


This completes the proof of Proposition 2.5, and with it the proof of Propo- 
sition 2.4. 
O 


§3. Class Field Theory — A Brief Review 


Class field theory describes the abelian extensions of a number field K 
in terms of the arithmetic of K. The theory of complex multiplication 
provides an analytic realization of class field theory for quadratic imaginary 
fields, much as cyclotomic theory gives a realization of class field theory 
for Q. In this section we will briefly review, without proof, the basic facts 
from class field theory which will be used in the sequel. We will begin with 
the classical version using ideals and ideal class groups. Afterwards we will 
present the more modern idelic version. For proofs of the theorems stated 
in this section and for additional material on (global) class field theory, the 
reader might consult Lang [5], Tate [7], or Neukirch [1]. We will mostly 
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restrict attention to totally imaginary fields, that is, fields with no real 
embeddings, since (except for §7) that is the only case we will use in the 
sequel. 

Let K be a totally imaginary number field and let L be a finite abelian 
extension of K; that is, L/K is Galois with abelian Galois group. As usual, 
we write Rx and Ry for the rings of integers of K and L respectively. Let p 
be a prime of K which does not ramify in L, and let $ be a prime of L 
lying over p. Thus the picture is 


L BY Ri /P 
finite abelian unramified extension of 
extension prime finite fields 

K p Rx/p 


By restriction, we get a homomorphism from the decomposition group of PB 
to the Galois group of the residue fields, 


{o € Gal(L/K) : B7 =P} — Gas ae cae : 


The right-hand Galois group is cyclic, generated by the Frobenius auto- 
morphism 
zr Nor. 

Further, since p is unramified, there is a unique element o, € Gal(L/K) 
which maps to Frobenius. Our notation reflects the fact that o, is deter- 
mined by the prime ideal p in K. For a general Galois extension L/K, p 
will only determine the conjugacy class of op, and making a new choice 
for $% will change o, by conjugation. But in our situation o, will not 
change, since we have assumed that L/K is abelian. Thus op € Gal(L/K) 
is uniquely determined by the condition 


dp(z) = aNéP (mod $B) for allx € Ry. 
Let ¢ be an integral ideal of K that is divisible by all primes that 
ramify in L/K, and let 
I(c) = group of fractional ideals of K which are relatively prime to c. 


Then the Artin map is defined using the oy’s and linearity: 
(., L/K) : I(e) — Gal(L/K), 
(a, L/K) = (1 rs 2/6) = [lo 
p p 


Notice that the Artin map is defined by piecing together local information, 
one prime at a time. The following theorem, which is a weak version of 
Artin’s reciprocity law, provides important global information. 
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Proposition 3.1. (Artin Reciprocity) Let L/K be a finite abelian ex- 
tension of number fields. There exists an integral ideal c C Rx, divisible 
by precisely the primes of K that ramify in L, such that 


((a),L/K)=1 for alla € K* satisfying a = 1(mod ¢). 


If (3.1) is true for the ideals ¢, and cg, then it also true for ¢; + co. 
There is thus a largest ideal for which (3.1) is true. We call this ideal the 
conductor of L/K and denote it by ¢z/x. 

In view of (3.1), it is natural to define the group of principal ideals 
congruent to 1 modulo c: 


P(c) = {(a) : a€ K*, a=1(mod c)}. 


Artin reciprocity says that the kernel of the Artin map contains P(c) for 
an appropriate choice of c. More precisely, 


ae P(cysK) => (a, L/K) = 1, 


It is important to observe that a principal ideal (a) may be in P(c) even 
if a #1 (mod c); all that is necessary is that there exist a unit € € R% such 
that €a = 1(mod c). 

Let p be a prime of K which is unramified in L. Then p splits com- 
pletely in L if and only if the extension of residue fields has degree 1, or 
equivalently if and only if (p, L/K) = 1. Thus the unramified prime ideals 
in the kernel of the Artin map are precisely the primes of K that split 
completely in L. 


Definition. Let c be an integral ideal of K. A ray class field of K (mod- 
ulo c) is a finite abelian extension K,/K with the property that for any 
finite abelian extension L/K, 


ch/K|¢ =>LIc K,. 


Intuitively, one can think of the ray class field as the “largest” field 
with a given conductor. However, it is important to note that the conductor 
of K, need not actually equal c. For example, the ray class field of Q(i) 
modulo the ideal (2) is just Q(z) itself, so Q(i)(2) has conductor (1). 


Theorem 3.2. (Class Field Theory) Let L/K be a finite abelian exten- 
sion of number fields, and let ¢ be an integral ideal of K. 
(a) The Artin map 

(.,L/K) : T(ezyx) — Gal(L/K) 


is a surjective homomorphism. 
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(b) The kernel of the Artin map is (NkIL)P(cr/K), where I, is the group 
of non-zero fractional ideals of L. 

(c) There exists a unique ray class field K, of K (moduloc). The conductor 
of K,/K divides c. 

(d) The ray class field K, is characterized by the property that it is an 
abelian extension of K and satisfies 


{ primes of K that 


split completely in i =a epeine napals EEK) fe 


Example 3.3. Consider the ray class field of K modulo the unit ideal ¢ = 
(1). It is the maximal abelian extension of K which is unramified at all 
primes. We call K,(1) the Hilbert class field of K and denote it by H or Hx. 
Notice that 


I(¢yK) = 1((1)) = {all non-zero fractional ideals of K}, 
P(cu/K) = P((1)) = {all non-zero principal ideals of K}, 


so the Artin map induces an isomorphism between the ideal class group 
of K and the Galois group of the Hilbert class field of K: 


(.,H/K) : CL(Re) — Gal(H/K). 


We will also need the following version of Dirichlet’s theorem on primes 
in arithmetic progressions. 


Theorem 3.4. Let K be a number field and ¢ an integral ideal of K. 
Then every ideal class in I(c)/P(c) contains infinitely many degree 1 primes 
of K. 


The Idelic Formulation of Class Field Theory 


We will now briefly recall how class field theory is formulated using ideles. 
This material will not be used until §7, so the reader may wish to omit the 
rest of this section until arriving at that point. 

Let K be a number field, and for each absolute value v on K, let Ky 
be the completion of K at v. Further, let R, be the ring of integers of K, 
if vy is non-archimedean, and let R, = K, otherwise. The idele group of K 


is the group 
/ 
x =[[ 
Vv 


where prime indicates that the product is restricted relative to the R,’s. 
This means that an element s € [] K% in the unrestricted product is in Ax 
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if and only if x, € R* for all but finitely many v. In particular, we can 
embed K™* into Aj, by using the natural diagonal embedding 


K— Ax, a+— (---,a,a,a,--°), 


since any a € K™* is in R* for all but finitely many K. Similarly, for any 
given v we embed Kj as a subgroup of AX via 


Kx — Ax, tr (...,1,1,¢,1,1,...). 
T 


v-component 


If v is a non-archimedean absolute value corresponding to a prime 
ideal p, we will often write K, and Ry, in place of K, and R,. We will also 
write ord, for the corresponding normalized valuation. 

Let s € AX be an idele. We define the ideal of s to be the fractional 


ideal of K given by 
(s) =. I[»"” #e 
p 


where the product is over all prime ideal of kK. Note that (s) is well defined, 
since s, is a p-adic unit for all but finitely many p. 

One makes Aj, into a topological group in the usual way; we will not 
need the precise definition of the topology. For any integral ideal c of K, 
let U, be the subgroup of Aj, defined by 


U. = {s € Ax : 5) € R$ and s,=1(modcR,) for all primes p}. 


Then U, is an open subgroup of Aj, and one proves that K*U, is a subgroup 
of finite index in Aj. 
If L/K is a finite extension, then there is a natural norm map from Aj 
to Aj,. This is a continuous homomorphism 
Nz >A; — Ax 
defined by the prescription that the v-component of Nz is 


Lw 
[Ncw 


wiv 


The idelic formulation of class field theory is given in terms of the 
reciprocity map described in the following theorem. 


Theorem 3.5. Let K be a number field, and let K?> be the maximal 
abelian extension of K. There exists a unique continuous homomorphism 


Ax, —> Gal(K?"/K), s+— [s, K], 
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with the following property: 


Let L/K be a finite abelian extension, and let s € Aj, 
be an idele whose ideal (s) is not divisible by any primes 
that ramify in L. Then 


[s, K]], = ((s), L/K). 


Here (-,L/K) is the Artin map, and Gal(K?>/K) is given the usual pro- 
finite topology. The homomorphism [-,K] is called the reciprocity map 
for K. 

The reciprocity map has the following additional properties: 
(a) The reciprocity map is surjective, and K* is contained in its kernel. 
(b) The reciprocity map is compatible with the norm map, 


El ac = [Nkz, K] for all x € Aj. 


(c) Let p be a prime ideal of K, let tig Cc Gal(K?>/K) be the inertia 
group of p for the extension K?>/K, let Ty © Ky be a uniformizer at p, 
and let L/K be any abelian extension that is unramified at p. Then 


(tp, K]|, = (p, L/K) = Frobenius for L/K at p, 
and 
(Re re 


There is, of course, much more to class field theory that we have not 
mentioned. For example, one often wants to know the exact kernel of 
the reciprocity map and the correspondence between subgroups of A}, and 
subfields of K#>. However, the only additional fact that we will need in 
this chapter is the following idelic characterization of ray class fields. 


Theorem 3.6. Let K be a number field, let c be an integral ideal of K, 
let K, be the ray class field of K modulo c, and let U, be the subgroup 
of Aj, described above. Then the reciprocity map induces an isomorphism 


[-, K] : At /K*U, > Gal(K,/K). 


In other words, |s, K]| acts trivially on the ray class field K, if and only if s 
can be written as s = au with a € K* and u € U,. 
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§4. The Hilbert Class Field 


Our goal in this section is to prove the following theorem. 


Theorem 4.1. Let K/Q be a quadratic imaginary field with ring of 
integers Rx, and let E/C be an elliptic curve with End(£) = Rx. Then 
K (j(E)) is the Hilbert class field H of K. 


Remark 4.1.1. Note that it is easy to produce an elliptic curve with 
endomorphism ring equal to Rx. For example, we could take FE to be the 
curve corresponding to the lattice Rx. Then 


: : g2(Rx)* 
E) = 9(Rx) = 17283— 

I) 92(Rx)? — 2793(Rx)? 

is given in terms of series go(Rx) and g3(Rx) involving the elements of Rx. 

Alternatively, if we write Ry = Zr + Z, then 


a ; 1 ‘a TENT 
j(E) = (RK) = sae t Dee, 


n=0 


where the c(n) € Z are the coefficients in the g-series expansion of j (I.7.4b). 
So Theorem 4.1 says that the Hilbert class field of a quadratic imaginary 
field K is generated by the value of a certain holomorphic function j(7T) 
evaluated at a generator for the ring of integers of K. 


We will actually prove much more than the mere statement of The- 
orem 4.1. We will give an explicit description of how the Galois group 
of H/K acts on j(£). To do this, we recall the homomorphism 


FP: Gal(K/K) oar CL(Rx) 
from §2 characterized by the condition 
Ev’ =F(o)*E for all o € Gal(K/K) and all E € ELL (Rx). 


Note that the kernel of F is actually a finite quotient of Gal(K/K), since 
any E will be defined over some finite extension L/K, and then F(a) = 1 
for o € Gal(K/L). Since CL(Rx) is an abelian group, F factors through 


F : Gal(K?>/K) —> C£L(Rx), 


where K* is the maximal abelian extension of K. Recall also the Frobenius 
element o, € Gal(K*>/K) corresponding to a prime p in K. The following 
proposition, together with basic class field theory, will serve to completely 
determine F. 
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Proposition 4.2. There is a finite set of rational primes S C Z such that 
if p ¢ S is a prime which splits in K, say as pRx = pp’, then 


F(op) =p € CL(Rx). 


Proposition 4.2 does not look very strong, since it determines F’ on 
fewer than half of all Frobenius elements. But we will be able to use it 
to get complete information about F’. Before proceeding with the proof of 
Proposition 4.2, we will derive some of its consequences, including a proof 
of Theorem 4.1. 


Theorem 4.3. Let E be an elliptic curve representing an isomorphism 
class in ELL( Rx). 
(a) K(j(£)) is the Hilbert class field H of K. 


(b) [OG()) : Q] = [KG(B)) : K] = he, 
where hy = #CL(Re) = #Gal(H/K) is the class number of K. 
(c) Let Fy,...,E, be a complete set of representatives for ELL (Rx). 
Then j(£),...,j(Ep) is a complete set of Gal(K /K) conjugates for j(E). 
(d) For every prime ideal p of K, 

i(E)* = G+). 
More generally, for every non-zero fractional ideal a of K, 


j(B)OH/) = j(a* E). 


Remark 4.3.1. It is now clear why we took the inverse when we defined 
the action of an ideal class a on an elliptic curve E,. If we had used the 
more natural definition a* BE, = Eaa, then the action of the Artin symbol 
on j(E£) in (4.3d) would instead have been j(E)(*"/*) = j(a-!* E). Thus 
we put the inverse into the action of CL(Rx) on ELL(Rx) so that the 
Artin symbol would act without an inverse. 


Proor (of Theorem 4.3). Let L/K be the finite extension corresponding to 
the homomorphism F’: Gal(K/K) - CL(Rx), by which we mean that L 
is the fixed field of the kernel of F’. Then 
Gal(K/L) = ker F 

= {0 € Gal(K/K) : F(c) = 1} 

= {0 € Gal(K/K) : F(o)* E=E} since by (1.2), CL(Rx) 

acts simply transitively on ELL(RK) 

= {0 € Gal(K/K) : E7 = E} from the definition of F 

= {0 € Gal(K/K) : j(E°) = j(E)} 

= {0 € Gal(K/K) : j(E)’ =35(E)} 

= Gal(K/K(j(E))). 
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Hence L = K(j(£)). Further, since F maps Gal(L/K) injectively into 
CL(RK), we see that L/K is an abelian extension. So we have shown 
that L = K(j(£)) is an abelian extension of K. 

Let ¢p;% be the conductor of L/K, and consider the composition of 
the Artin map with F, 


(..L/K) 


F 
T(crK) Gri«k ——> CL(Rx). 


We claim that this composition is just the natural projection of I(¢y/x) 
onto C£L(Rx). In other words, we wish to establish the 


Claim: F((a,L/K)) =a for all a € I(cy/x)- 

Let a € I(cz/x), and let S be the finite set of primes described in (4.2). 
From Dirichlet’s theorem (3.4) there exists a degree 1 prime p € I(cz/K) 
in the same P(cy/x )-ideal class as a and not lying over a prime in S. In 
other words, there is an a € K™ satisfying 


a =1(mod cz/K) and a= (a)p. 


We compute 


F((a,L/K)) = F(((a)p, L/K)) since a = (a)p 
((p, L/K)) since a = 1 (mod c;/x) 
from (4.2), since NGp ¢ S 


since a = (a)p. 


oll 
Al Sl hy 


This completes the proof of the claim. 
Notice that as an immediate consequence we find that 


F(((a),L/K)) =1 for all principal ideals (a) € I(¢z/x), 


and not just for those that are congruent to 1 modulo ¢z/%. We also know 
that the map F': Gal(L/K) — C£(Rx) is injective, so this implies that 


((a),L/K) =1 for all (a) € I(cz/K). 


But the conductor of L/K is the smallest integral ideal ¢ with the property 
that 
a = 1 (mod c) => ((a),L/K) =1. 


(See §3.) It follows that ¢z;~% = (1). The conductor is divisible by every 
prime that ramifies (3.1), from which we conclude that the extension L/K 


is everywhere unramified. Therefore L is contained in the Hilbert class 
field H of K. 
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On the other hand, the natural map I(¢z/;%) = I((1)) + C£(Rx) is 
clearly surjective, so the claim implies that F : Gal(L/K) — C£L(Rx) is 
surjective, hence an isomorphism. Therefore 


[L : K] = #Gal(L/K) = # CL(Rx) = #Gal(H/K) =[H: K]. 


This combined with the inclusion L Cc H proves that L = H. Since L = 
K Gj (E)), this completes the proof of (a), as well as the second equality 
in (b). 

To prove the first equality in (b), we use the observation (2.2.1) that 


[Q(j(E)) : Q] < he. 


This inequality combined with [K(j(E)) : K] = hx and [K : Q| = 
implies that [Q(j(Z)) : Q] = hx, which completes the proof of (b). 

Next, from (1.2b) we know that C£L(Rx) acts transitively on the set 
of j-invariants 


d= {j(F1),---,5(En)}, 


since by [AEC III.1.4b] the set ELL(Rx) may be identified with the j- 
invariants of its elements. The map F : Gal(K/K) — C£L(Rx) is defined 
by identifying the action of Gal(K/K) on g with the action of CL(Rx) 
on g, so Gal(K/K) also acts transitively on J. Therefore J is a complete 
set of Gal(K/K) conjugates of j(E), which proves (c). 

Finally, we see that the claim proven above gives (d) for all ideals 
in I(cz/K). But ¢p;~K = (1), so I(cz/x) is the set of all non-zero fractional 
ideals of K. Oo 


It remains to prove Proposition 4.2. For that purpose we will need the 
following result which says that isogenies behave nicely under reduction. 


Proposition 4.4. Let L be a number field, 8 a maximal ideal of L, E,/L 
and E2/L elliptic curves with good reduction at P, and E, and E2 their 
reductions modulo $8. Then the natural reduction map 


Hom(F,, Ez) =e Hom(£1, Ea), o) >. d, 
is injective. Further, it preserves degrees, 
deg(@) = deg(¢). 
PROOF. Since the degree of a non-zero isogeny is non-zero, the injectivity 


follows from the preservation of the degree. However, the proof of injectiv- 
ity is more elementary, so we will give a separate proof. 
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Let ¢: E, > E> be an isogeny satisfying ¢ = [0]. For any integer m 
prime to 8, [AEC VII.3.1b] says that E2[m] injects into Ey. On the other 
hand, if T € E,|m], then by assumption 


WT) = 6(f) = 0. 


Since ¢(T) € E2[m], it follows that ¢(T) = O. Therefore E;[m] C ker(@). 
This holds for arbitrarily large m, so we must have ¢ = [0]. 

Now we begin the proof that deg(¢) = deg(#). Choose a rational 
prime @ relatively prime to 8. Our idea is to use the Weil pairing and 
calculate everything on the Tate modules. (See [AEC III.8.3] for the prop- 
erties of the Weil pairing eg : Te(E) x Te(E) — Te(ye) that we will need.) 
For any z,y € Ty(F1) we have 


CE; (a, y) ie? = €F, ((deg Q)z, y) = e€F, (doz, y) = €E, (ox, ey), (i) 


and a similar calculation on E} gives 
9 (z jie F; ( Ag by) (ii) 
ep (LY CF, (PL, dy). 


Next we observe that if F/L is any elliptic curve with good reduc- 
tion at $8, then T;(E) = T)(E). This crucial equality is a consequence 
of [AEC VIL.3.1b], which says that E[¢"] & E[é"] for all n. Looking at the 
definition of the Weil pairing [AEC III §8], we see that 


en(@,y) = ep (2,9) for all a, y € Te(E). (iii) 


We now take x,y € T;(£1) and compute 


CoC ae 
=en,($2,dy) from (i) 
= ep, (02, oy) from (iii) 
= ep, (G2, O79) 
= ep (€,9)%%* from (ii). 


This equality holds for all x,y € T(E), hence for all Z,y € Te(E1). The 
non-degeneracy of the Weil pairing on T;(£,) now implies that deg¢ = 
deg ¢. oO 


PRooF (of Proposition 4.2). We know that ELL( Rx) is finite from (1.2b) 
and that every curve in ELL(Rx) can be defined over Q from (2.1c), so 
we can choose a finite extension field L/K and representatives E),..., En 
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defined over L for the distinct K isomorphism classes in €£.£(R). Further, 
using (2.2c) we may replace L by a finite extension so that every isogeny 
connecting every pair of £;’s is defined over L. We now let S be the finite 
set of rational primes satisfying any one of the following three conditions: 


(i) p ramifies in L, 
(ii) some FE; has bad reduction at some prime of L lying over p, 
(iii) p divides either the numerator or the denominator of one of the 
numbers NG(j(Ei) — j(Ex)) for some i # k. 


Notice that condition (iii) means that if p ¢ S and if is a prime of L 
dividing p, then FE; # EF; (mod 9B), since their j-invariants are not the same 
modulo . 

Now let p ¢ S be a prime which splits as pRx = pp’ in K, and let B 
be a prime of L lying over p. Also let A be a lattice for E, so E(C) = C/A. 
Choose some integral ideal a C Rx relatively prime to p such that ap is 
principal, say 

ap = (a). 

From [AEC VI.4.1b] there are isogenies connecting FE, p* E, and axp* EF 
corresponding to the natural analytic maps as indicated in the following 
diagram: 


ChA = Ce tA = Car. = CHE SSC 


[e It le |e 
i = > se — quaged =. Gk =k 


~N 


Next we choose a Weierstrass equation for E/L which is minimal 
at $B (see [AEC VII §§1,2]) and let 


dz 
w= ———___——_ 
2y + a,x + ag 
be the associated invariant differential on E. The pull-back of w to C/A will 
be some multiple of dz. Since the map along the top row of our diagram is 


simply z — az, we see that dz pulls back to d(az) = adz. Tracing around 
the commutative diagram, we conclude that 


(Acwo )*w = aw. 


As usual, we will use a tilde to denote reduction modulo $. Since the 
equation for E/L is minimal at %, we obtain an equation for EF by reducing 
the coefficients modulo $8, and so the reduced differential 


dx 


oo 5s Sa 
2y + a,x + a3 
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is a non-zero invariant differential on E. Further, since (a) = ap and 
since $B divides p, we find 


(Lopod)*e =(A0p06)*w = 40 =0. 
It follows from [AEC II.4.2c] that 
ow ¢ is inseparable. 
On the other hand, using (4.4) and (1.5a), we see that 
deg 6 = deg $ = NOP =P, 
deg b = deg = N§a, 
deg A = degA = 1. 


Since NG a is prime to p by assumption, both w and \ are separable, so we 
conclude that wees 

¢:E—p*E 
must be inseparable. Now any map (such as d) factors as a g'*-power 
Frobenius map followed by a separable map [AEC H.2.12], so the fact 
that @ has degree p and is inseparable implies that ¢ must “be” the p*- 
power Frobenius map. More precisely, there is an isomorphism from E®) 
to p+E so that the composition 


th 
~ Pp -power ae ~ a-ak 
Bo SE ae 
Frobenius 


equals @. 
In particular, we find that 


j (B+ E) =5 (EB) = 5(B, 
so 
j(B¥E) = j(B)? = j(B)NC? = j(B)"" = j(B%) = j(F(op)*E) (mod ). 
But from the original choice of excluded primes S, we have 
J(E;) = j(Ex) (mod $8) ifand only if FE; = Ex. 


Hence px E = F(o,) * E, and the simplicity of the action of CL(Rx) 
on ELL(Rx) (1.2b) gives the desired conclusion 


F(op) =p. 
O 
We also record for later use the following fact which we proved during 
the course of proving Proposition 4.2. 


128 II. Complex Multiplication 


Lemma 4.5. Let E be an elliptic curve with complex multiplication 
by Rx, and suppose that E is defined over a number field L. Then for all 
but finitely many degree 1 primes p of K, the natural map 


E—p*k 
has degree p, and its reduction 
E— +k 


is purely inseparable. (Here we reduce modulo some prime $8 of L lying 
above p.) 
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Let & be an elliptic curve with complex multiplication by a quadratic imag- 
inary field K. In this section we are going to describe the field generated 
by the points in E(C)tors, much as we described the field generated by j(F£) 
in the last section. Our goal is to use the torsion points of FE to generate 
abelian extensions of K. 

Before beginning, we briefly recall the analogous (but simpler) case of 
cyclotomic extensions. In this case the elliptic curve E(C) is replaced by 
the multiplicative group G,,(C) = C*. Let 


pn = ker (Gm(C") ane Gn(C’)) 


be the group of N-torsion points of G,, as usual; that is, wx is the group 
of N*-roots of unity. As is well known, the extension Q(j.)/Q is an 
abelian extension that is ramified only at primes dividing N. Let p be a 
prime with p{N, choose a generator ¢ for yey, and let a, € Gal(Q(¢)/Q) 
be the Frobenius element associated to p. Also let % be a prime of Q(C) 
lying above p. Then by the definition of o, we have 


cvr = CP (mod §f). 


But 1,¢,¢?,...,¢%~! are distinct modulo 8, since our assumption B { N 
implies that X% — 1 is separable in characteristic p. Hence the congruence 
is an equality, 

re, 


and so we conclude that 


Op =1 = p=1(modN). 
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Therefore Q(¢) = Q(fen) is the ray class field of Q of conductor N. (Ac- 
tually, it is the ray class field of conductor Noo. Since the base field Q is 
not totally imaginary, we have to also consider ramification of the infinite 
place.) 

Now let L/Q be any abelian extension, and let N be the conductor 
of L. Then class field theory (3.2) says that L is contained in the ray class 
field of conductor N, so we recover the following famous result. 


Theorem 5.1. (Kronecker-Weber Theorem) Every abelian extension of 
Q is contained in a cyclotomic extension; that is, given any finite abelian 
extension L/Q, there is a root of unity ¢ such that L Cc Q(¢). 


Thus the ray class fields of Q are generated by the values of the analytic 
function 


e2tz eat > cual 


n 
n>0 


evaluated at points of finite order in the circle group R/Z; that is, they 
are generated by numbers of the form e?7**/" with a, N € Z. Further, the 
action of a Frobenius element a, on the value e27a/N is given explicitly by 
the formula 


(on™) ? _. ¢2tiap/N provided p{ N. 


Thus the Galois action of op is transformed into a multiplication action on 
the circle group. The reader who has a good understanding of this cyclo- 
tomic theory will have no trouble seeing how all of its main elements are 
reproduced in the theory of complex multiplication as described in this sec- 
tion and in §8, albeit with a number of additional technical complications. 

As usual, we let Rx be the ring of integers in a quadratic imaginary 
field K, and let E be an elliptic curve with complex multiplication by Rx. 
We will always assume that the isomorphism [-] : Rx — End(£) is nor- 
malized as in (1.1). 

We begin with an important lemma which tells us when an endomor- 
phism of the reduced curve EF (mod) actually comes from an endomor- 
phism of E. 


Lemma 5.2. Suppose that EF is defined over the number field L, let % 
be a prime of L at which E has good reduction, and let E be the reduction 
of FE modulo 8. Let 

6: End(£) —> End(£) 


be the natural map which takes an endomorphism to its reduction mod- 
ulo %. Then for any y € End(£), 


y € Image(@) <> y commutes with every element in Image(6). 
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In other words, Image(@) is its own commutator inside End(E). 


PrROooF. One direction is trivial, since if y € Image(@), then y certainly 
commutes with elements of Image(@). This is immediate from the fact 
that End(£) © Rx, which implies that Image(@) is a commutative ring. 

For the other direction, we first note that @ is injective from (4.4). 
From [AEC III.9.4], End(£) is an order in either a quadratic imaginary 
field or in a quaternion algebra. If it is an order in a quadratic imaginary 
field, then @ is an isomorphism, since, by assumption, End(£) & Rx is the 
maximal order in K. So in this case we are done. 

Next we consider the case that End(£) is an order in a quaternion 
algebra H. Then Image(@) ® Q is a quadratic subfield of H, call it K. 
(Note K & K, but it is possible for H to contain several distinct subfields 
each isomorphic to K.) We start by choosing a Q-basis {1,a} for K such 
that a? € Q; and then we extend it to a Q-basis for H of the form 


H = Q+ Qa + O86 + Qags 
satisfying 
a’, 8", (aB)? €Q and a8 = —Ba. 
(See the proof [AEC III.9.3].) Now it is easy to find the commutator of K 


in H. For any y € H, we write y = d+aa+b6G+ caf with a,b,c,d € Q 
and compute: 


7 commutes with K 
Ya = ay 
(d+ aa+t+ bB + caB)a = a(d+ aa + bB + cap) 
da + aa? + bBa+ caBa = da + aa? + baf + ca?B 
—ba — ca? B = baB+ca’?Z since a3 = —Ba 
b=c=0 

since a? € Q and {1,a, 3, a3} is a Q-basis for H 
— y=d+aaeQ+Qa=X. 


frat 


Finally, let 6 € End(E) commute with Image(@). Then 6 commutes 
with K, so from what we have just done, 6 is in K. But we also know that 6 
is integral over Z and that Image(@) & Rx is the maximal order in K = k, 
hence 6 € Image(@). This completes the proof of Lemma 5.2. Oo 


As usual, let E be an elliptic curve with complex multiplication by Rx. 
In the last section (4.3a) we proved that 


H = K(j(B)) 


is the Hilbert class field of K. Since j(E) € H, this means we can find an 
equation for E with coefficients in H, so we may as well assume that EF is 
defined over H. The next proposition says that we can lift the p**-power 
Frobenius map E > E) to a map in characteristic 0. 


§5. The Maximal Abelian Extension 131 


Proposition 5.3. Let K be a quadratic imaginary field, H the Hilbert 
class field of K, and E/H an elliptic curve with complex multiplication 
by Rx. Let op € Gy/K be the Frobenius element associated to a prime p 
of Rx, and let B be a prime of H lying over p. Assume that p has degree 1 
and is not in the finite set of primes specified in (4.5), so in particular E 
has good reduction at $8. Then there exists an isogeny 


AX: BE — EB 
whose reduction modulo $B, 

A: B— E®), 
is the p**-power Frobenius map. 


Remark 5.3.1. In general, there is no reason to expect an elliptic curve 
to be isogenous to one of its Galois conjugates. Of course, there are always 
maps 

EB Ev 


| ! 


ee p'? power 


Boe Fe pe) 
Frobenius 


where the vertical maps are “reduction modulo $B.” The content of Propo- 
sition 5.3 is that there is an isogeny 1: E — E> which makes this picture 
into a commutative square. Thus X lifts the Frobenius map from charac- 
teristic p to characteristic 0. 

ProoF (of Proposition 5.3). To ease notation, we will write o in place 


of gp. From (4.5) there is an isogeny E — p * FE whose reduction E —- 
pee is purely inseparable of degree p. Composing this isogeny with the 
isomorphism p * & = E? provided by (4.3), we get an isogeny A: ERO Ev? 
which is purely inseparable of degree p. It follows from [AEC III.4.6] that 
factors as 
Bb *, BO), Be. 

where ¢ is the p**-power Frobenius map and deg ¢ = 1. But, by definition, 
the reduction of E? is precisely E“), so ¢ is an automorphism of E*. If we 
can show that ¢ is the reduction modulo $B of some eg € Aut(E7), then we 
can replace A by €9 1X and be done. So we need to prove the 


Claim: ¢ lies in the image of Aut(E7) inside Aut(E?). 
From (5.2), it suffices to show that ¢ commutes with the image of End(E7) 
inside End(E7). (This will allow us to lift ¢ to an €9 in End(£7), and 


then (4.4) will imply that ¢9 has degree 1, so it is in Aut(EZ7).) Recall that 
we have normalized isomorphisms 


[-le : Rr = End(£) and [-]ze : Rr = End(E”), 
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and that from (1.1.1) these isomorphisms satisfy 
0° [ale = [alee od for alla € Rx. 


Next we look at the reduction of [a] modulo 8. In general, suppose 
that f : V — W is any rational map of algebraic varieties over a field k 
of characteristic p, let dv : V ~ V®) and dw : W — W®) be the p*- 
power Frobenius maps, and let o € Aut(k) be the pt-power Frobenius 
automorphism of k. Then f7 : V‘?) + W) is a rational map and 


dwof =f? ody. 


To see that this is true, write f = [fo,..., fn] (locally) as a map given 
by homogeneous polynomials. The desired result then follows from the 
observation that for a polynomial f(x) = f(71,...,2m) = }>a;x", we have 


\P 
£7 (60) = Do aPx'” = (Slax!) = o(f(x)). 
We now apply this general fact to the map (ale : E > E. Note that 
[ale = [alps 
from (2.2a), since o € Gy x fixes a € K. Thus we get 


$0 [ale = [alZo¢ = [alae 04. 


Using this, we compute 


[alze 0606 = [alge OK since cog =A 
=o [a] from above, 
=ecogolalg 
= e0lalne o¢ from above. 


Therefore [a] ge o€ = €0[a]g¢, which completes the proof of our claim and 
with it the proof of Proposition 5.3. Oo 


An important special case of Proposition 5.3 occurs when the ideal p 
is principal, in which case op = (p, H/K) = 1. Then A is an endomorphism 
of E. We can identify that endomorphism quite precisely as follows. 
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Corollary 5.4. Let K bea quadratic imaginary field, H the Hilbert class 
field of K, and E/H an elliptic curve with complex multiplication by Rx. 
For all but finitely many degree 1 prime ideals p of K that satisfy 


(p, H/K) = 1, 


there is a unique 7 = Tp € Rx such that 
[7] 


3] 


p=7Rk, and 


p' power = 


Rh — & 
(Gans 


Frobenius 
is a commutative diagram. (Note that the condition (p,H/K) = 1 is 
equivalent to p being a principal ideal.) 


PROOF. Let ‘B be a prime of H lying over p. Having excluded finitely 
many p’s, including those for which E (mod 9B) is singular, we may use (5.3) 
to obtain a commutative diagram 

» 


E — EH 
| | 
E 2. Be), 


Here o, = (p, H/K), 2 is an isogeny, ¢ is the p'*-power Frobenius map, 
and the vertical maps are reduction modulo 9B. 

Our assumption that (p,H/K) = 1 means that EF? = E, so Xi s really 
an endomorphism of E, say A = [x]. It also implies that E® = . Thus 
we have a commutative diagram 


[7] 
— 


E E 
ora 8 
Now we compute 
NO p=p since p has degree 1 
= deg since ¢ is p*” power Frobenius 
= deg [7] from (4.4), since [7] = 
= INS z| from (1.4b). 


Since p is a prime ideal in the quadratic field K, this means that either 


p=7RxK or p=7'Rx, 
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where 7’ is the Gal(K/Q)-conjugate of 7. We can use the fact that (£,[-]) 
is normalized to check which one it is. 

We take an equation for E./H with good reduction at PB and let w € O¢ 
be a non-zero invariant differential whose reduction @ is a non-zero invariant 
differential on E. Then the normalization (1.1) says that [z]*w = mw, so 


to = Tw = [n)]*w = [r]*o = *o =0. 


The last equality follows from [AEC II.4.2c], since the Frobenius map ¢ is 
inseparable. Now 9 is a one-dimensional vector space generated by w, 
so 7 = 0. In other words, 


nc =0 (mod $), 


so 7 € BOK =p. Since we saw above that p equals either Rx or nr’ RK, 
we conclude that p = Rx. This finishes the existence half of (5.4). 
To see that 7 is uniquely determined, we need merely observe that the 
composition 
Rx +, End(E) — End(E) 
is injective. Since 7 is required to satisfy [7] =@oeE End(E), there is at 
most one such 7. Oo 


Our goal is to show that the torsion points of an elliptic curve EF with 
complex multiplication by Rx can be used to generate abelian extensions 
of kK. It would be nice if the torsion points themselves should generate 
abelian extensions of K, but unfortunately it turns out that they only 
generate abelian extensions of the Hilbert class field H of K. In order to 
pick out the correct subfield, we take a model for FE defined over H and fix 
a (finite) map 

h: E — E/Aut(E£) =P’ 


also defined over H. Such a map h is called a Weber function for E/H. 


Example 5.5.1. If we take a Weierstrass equation for E of the form 
2—=734Ar+B with A,BeH, 


then the following is a Weber function for E'/H: 


x if ABO, 
h(P) = A(z, y) = a? if B=0, 
a if A=0. 


So in essence, except for the two exceptional cases 7 = 0 and j = 1728, a 
Weber function is just an x-coordinate for the curve. 
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Example 5.5.2. It is also possible to define a Weber function analytically 
in such a way that we don’t have to worry about fields of definition. For 
example, if we choose a lattice A and an isomorphism 


f:C/A——> E(C), z+ (p(z, A), @'(z,A)), 


then the following is a Weber function for EF: 


BE o(e, 4) if j(E) 4 0, 1728, 
2 
h(f(z)) = ao e(z, A)? if (EZ) = 1728, 
ae e(z, A)3 if j(E) =0. 


Here A(A) = go(A)*—2793(A)? 4 0 is the usual modular discriminant. The 
reader may easily verify that this Weber function is model independent; 
that is, it does not change if we take a new lattice for F, or equivalently 
a new Weierstrass equation for EF. Since we know from (4.3a) that it is 
possible to find an equation for E defined over H, it follows from the model 
independence that this Weber function h: E — P! is defined over H. 


To generate abelian extensions of K, we will use the values of a Weber 
function on torsion points, which essentially means we will take the z- 
coordinates of the torsion points. Recall from §1 that for any integral 
ideal c of Rx we defined the group of ¢-torsion points of F to be 


Efe] ={PEE: [y]/P =0 for all y Ec}. 


The reader is advised to compare the following theorem with the cyclotomic 
theory discussed at the beginning of this section. 


Theorem 5.6. Let K bea quadratic imaginary field, let E be an elliptic 
curve with complex multiplication by Rx, and let h: E > P! be a Weber 
function for E'/H as described above. Let ¢ be an integral ideal of Rx. 
Then the field 


K (j(E), h(Ele])) 
is the ray class field of K modulo c. 
Corollary 5.7. With notation as in Theorem 5.6, 
K*> = K(j(E), h(Etors))- 


In particular, if j(E) # 0,1728 and if we take an equation for E with coef- 
ficients in K(j(E)), then the maximal abelian extension of K is generated 
by j(E) and the x-coordinates of the torsion points of E. 


PROOF (of Theorem 5.6). Let 
L= K (j(E), h(E[e])). 
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Then L > K(j(£)), and from (4.3a) we know that K(j(£)) = H is the 
Hilbert class field of K. In order to show that L is the ray class field of K 
modulo ¢, we need to prove that 


(p,L/K)=1 — > pe P(c). 
As usual, it suffices to prove this for all but finitely many degree 1 primes 
in Kk. 
Suppose first that p is a degree 1 prime of K with p € P(c). This 
means that 


p= pRx for some uw € Rx with pw = 1(mod c). 


In particular, p is principal, so (p,H/K) = 1. Hence we can apply (5.4) 
(after excluding finitely many p’s) to get some a € Rx such that 
[r] 


E— E 
p=7RxK and | | commutes. 
BF 


Since 7Rx = p = uRx, there is a unit € € Rj, such that 7 = Ep. Notice 
that [€] € Aut(£), so [7] and [yu] differ by an automorphism of E. 

We already know that (p,L/K) fixes H = K(j(E)), so in order to 
show that it fixes all of L, we must show that it fixes h(E[c]). Let T € E[c] 
be any c-torsion point. Then the commutative diagram gives 


TOLIK) = §(P) = [xT 


On the other hand, [AEC VII.3.1b] tells us that the reduction map E > E 
is injective on torsion points whose order is prime to p. So if we exclude from 
consideration the finitely many p’s which divide #E[c], then the reduction 
map 

Ec] — E[¢] 


is injective. Therefore 
TPL/K) — alP. 


Now we compute 
A(T)@L/K) = h Ce since (p, H/K) = 1 and 
h: E > P! is defined over H 


= h{[r]T) from above 
=A oldT) since x = &y 
= h((y|T) since h is Aut(F)-invariant and 


[é] € Aut(E) 
=h(T) since T € E[c] and p = 1(mod ¢). 
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This completes the proof that 
p € Plc) = (p,L/K) =1. 


In order to prove the converse, we take a prime of degree 1 satisfy- 
ing (p, L/K) =1. Then 


(, H/K) = (p,L/K)|, =1, 


so (excluding finitely many p’s) we can apply (5.5) as usual to get aa © Re 
such that 


In] 


EK — &£F 
p=7RK and | | commutes. 
Bee oe 


We also choose some o € Gx x whose restriction to K*? is (p, K*?/K). 
Then in particular ol, = (p, L/K) = 1, and also o|y =1 since H Cc L. 
Now let T € E[c] be any c-torsion point. We compute 


= h(o(T)) from the commutative diagram 

=h (7") since o reduces to p*’ power Frobenius 
= h(T*) 

- h(T)? since o|4 = 1 and h is defined over H 
= h(T) since h(T) € L and oly = 1 

=h(T). 


Next we observe that the reduction of h modulo $B is the map 
h: 6 — B/AutE & B/Aut EB. 


(N.B. The image is not E/ Aut FE, since Aut E may be larger than Aut(E).) 


It follows from this and the equality h((n|T) = h(T) proven above that 
there is an automorphism [€] € Aut(F) such that 


[n]T = (elf. 


Again using the injectivity of the torsion Ee] @ Ele] from [AEC VIL.3.1b], 
we find that [x — €|T =O. 

A priori, the particular € for which [7 — €]T = O might depend on T. 
But from (1.4b) we know that E[c] is a free Rx /c-module of rank one. 


138 II. Complex Multiplication 


Hence there is a single € € Ri such that [7 — €] annihilates all of E[c], 
which implies that 7 = € (mod c). Therefore 


€-'t=1 (mod ¢), 


and of course we have p = Rx = (€~!m)Rx since € is a unit. This proves 
that p € P(c), which completes the proof of Theorem 5.6. QO 


PROOF (of Corollary 5.7). Let L/AK be any finite abelian extension and 
let cz/x be the conductor of L/K. By class field theory (3.2c), L is con- 
tained in the ray class field of K modulo ¢z/x. Using (5.6), this means 
that 

LC K(j(E), A(Elersx]))- 


Taking the compositum over all conductors gives L C K(j(F), h(Etors)), 
and then taking the union over all L’s gives K*? Cc K(j(E),h(Etors))- 
But (5.6) says that K(j(E),h(Etors)) is a compositum of abelian exten- 
sions, hence it is abelian, hence it equals K*?. This completes the first part 
of (5.7). 

The second part of (5.7) is then immediate from (5.5.1), which says 
that if 7(£) € 0,1728, then the x-coordinate on a Weierstrass equation 
for E/Q(j(E)) is a Weber function for E. Oo 


Example 5.8. Corollary 5.7 raises the obvious question of what happens if 
we adjoin all of Fyors to K, rather than just the values of a Weber function. 
In general one does not get an abelian extension of K, although we have 
seen (2.3) that Eyors generates an abelian extension of H. (The reader 
might try to use (5.4) to construct another proof of this fact.) Suppose 
now we look at the special case that K has class number 1, so H = Kk. 
Then we have inclusions 


Kea An hid) ) CR en) GH Sie. 
Thus 
K has class number1 => K®? = K(h(Etors)) = K(Etors): 


The j-invariants of these curves will be in Q. For a complete list of all CM j- 
invariants in Q, together with representative Weierstrass equations, see 
Appendix A 83. 


Example 5.8.1. We will illustrate (5.6) and (5.8) with the curve 
E:y=e+a 


which has complex multiplication by the ring of Gaussian integers Z/i] in 
the field K = Q(z). Clearly 


[2] = {O, (0,0), (+#,0)}, 
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so K(E[2|) = K. One can easily check that the ray class field of K mod- 
ulo 2 is K, so this confirms (5.6). 

Next we look at points of order 3. Letting T = (z,y) € E, the dupli- 
cation formula reads 

ie ate? +h x8 + 524 — 5a? —1 
4y?2 8y3 
So setting 2(2T) = x(T), we find (after some algebra) that 
BD a ea See Or = SO 


The four roots of this equation are 


a, —a : : here @ ees 
o) ’ a ’ Ww a ae aa 

Via’ V3a 3 
Since the Weber function on E is h(z,y) = x, this gives K(h(E[3])) = 
K (V3), which the reader may verify is indeed the ray class field of kK 
modulo 3. 

Substituting these four values for x into y? = 2° + 2 and solving for y, 
we find the y-coordinates of the points in [3]. If we let 


ae - [4 


then the nine points in E[3] ar 


Since K = Q(z) has class number 1, (5.8) says that the field A (3) is an 
abelian extension of K, but it is not necessarily a ray class field. We leave 
it for the reader to check directly that K(3)/K is abelian. 

Next, T = (az, y) is a point of exact order 4 if and only if y(2P) = 0. 
Using the duplication formula given above, if we let y = (2 —1)i, then 
the x-coordinates of the points of order 4 satisfy 


0-28 + Sat — 5a? 1 = (eet Ye — Yet y@-V\et7"), 


Hence K(h(E[4])) = K(y?) = K (V2), which is the ray class field of K 
modulo 4. Finally, if we let 6 = (1 + i) (2 — 1), then we find that 


E{4] = {o, (0,0), (+i, 0), (1,£V2), (7, +6), 
Gm ye Goma) 2 


So in this case K (E[4]) = K(y7,6) = K (V2) is equal to K(h(E[4])). 
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§6. Integrality of 7 


We have seen (2.1b) that the j-invariant of an elliptic curve E with complex 
multiplication is an algebraic number. In this section we are going to 
prove that j(£) is in fact an algebraic integer or, equivalently, that E has 
everywhere potential good reduction. The results of this section will not 
be needed until 810, so the reader primarily interested in the relationship 
between complex multiplication and class field theory may wish to skip 
directly to 87. 


Theorem 6.1. Let E/C be an elliptic curve with complex multiplication. 
Then j(£) is an algebraic integer. 


We are going to give three proofs of this important fact, two in this section 
and a third in (V.6.3). In order to help the reader understand the different 
approaches used in these three proofs, we will start with a brief description 
of each. 


The Complex Analytic Proof 


Let A; and Ag be lattices corresponding to elliptic curves £,/C and F2/C, 
and suppose that E; and E> are isogenous. Then we will show that j(E)) 
and j(&2) are algebraically dependent over Q by explicitly constructing 
a polynomial F(X,Y) € Z[X,Y] with F(j(£\),j(Z£2)) = 0. If E has 
complex multiplication, then by taking EF, = By = E we will obtain a 
monic polynomial with j(£) as a root. Thus we show that j(£) is integral 
over Z by explicitly constructing a monic polynomial with j(£) as a root. 
This proof has the advantage of being very explicit, and the disadvantage 
that it does not generalize to higher dimensions. 


The £-adic (Good Reduction) Proof 


This proof, which is due to Serre and Tate [1], readily generalizes to abelian 
varieties of arbitrary dimension. The idea is to use the criterion of Néron- 
Ogg-Shafarevich [AEC VII.7.3] to prove directly that E has potential good 
reduction at all primes, which implies by [AEC VII.5.5] that j(£) is in- 
tegral at all primes. Thus let L be a local field and E/L an elliptic 
curve with complex multiplication. We have seen (2.3) that the action 
of Gal(L/L) on the Tate module T;(£) is abelian. (For another proof of this 
fact that uses nothing more than a little linear algebra, see exercise 2.6.) 
In other words, Gal(L*?/L) acts on T)(E). Next we use the description 
of Gal(L*>/L) provided by local class field theory to show that the action 
must factor through a finite quotient of Gal(L*?/L), which allows us to 
apply [AEC VIL.7.3]. 


The p-adic (Bad Reduction) Proof 


For this proof, which is due to Serre, we assume that j(£) is not integral at 
some prime p and prove that & has no non-trivial endomorphisms. Let L 
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be a complete local field with maximal ideal p, and let E'/L be an elliptic 
curve whose j-invariant is non-integral at p. We will show (V §§3,5) that 
after replacing L by a quadratic extension, there is an element q € L* and 
a p-adic analytic isomorphism of groups 


Lg? = BCL), 
Using this isomorphism, we construct (V.6.1) an element of Gal(Z/L) which 


acts on the Tate module of & via the matrix (4 t) (relative to a suitable 


basis). Then the fact that endomorphisms commute with the action of Ga- 
lois will allow us to conclude that there are no non-trivial endomorphisms, 
so FE does not have complex multiplication. 


It is worth remarking that the second and third proofs are local; one 
shows that j(£) is integral by working one prime at a time. The first proof, 
on the other hand, is more global in nature. We are going to give the first 
two proofs in this section. For proof number three, see (V.6.3). 


Example 6.2.1. Note that the three elliptic curves in (2.3.1) possessing 
an endomorphism of degree 2 all have j-invariants in Z, as they should 
from (6.1), since the corresponding quadratic imaginary fields have class 
number 1. More generally, if K has class number 1, then j(Rx) will be a 
rational integer. 

As is well known, there are only nine quadratic imaginary fields of 
class number 1, a fact conjectured by Gauss and proven by Heegner [1]. 
(See also Baker [1] and Stark [1].) These fields are 


Q(V=1), Q(vV=2), @(v-3), Q(v-7), Q(v-M), 
Q(v-19), Q(v-43), Q(v—67), Q(v—163). 


A list of the corresponding j-invariants is given in Appendix A 83. It follows 


for example that 
14+ /-—163 


Recall (I.7.4b) that j(7) has the g-expansion 
1 
j(q) = ; + 744 + 196884q + 21493760q? + --- , 


where q = €?"’7. If we substitute 7 = (1 + /—163) /2, then 


q=—e 7V16 =~ —3.809 - 10718 


is very small. Thus the main term in j(q) will be 1/g, which means that 1/q 
should be “almost” an integer. Computing 1/q to 40 significant digits we 
find that 


e7¥ 163 — 262537412640768743.999999999999250072597 . .., 
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V163 is an integer to 12 decimal places. Of course, we know a priori 
that e™ V1 is not an integer and in fact is not even an algebraic number, 
since the Gel’fond-Schneider theorem says that e™* = (—1)~** is transcen- 
dental whenever ta is algebraic of degree at least 2 over Q. 


so e” 


Example 6.2.2. Now let’s look at an example with class number larger 
than 1. For example, consider the field KA = Q(V—15) and its ring of 


1+VJ-15 It 


is not hard to check that Rx has class number 2 and that a non-trivial ideal 
class is given by a = 2Z+ QZ. Further, one can check that the field H = 
K (V5, /—3) is everywhere unramified over K, so it is the Hilbert class 
field of K. (See exercise 2.11.) It follows from (4.3) that H = K(j(Rx)) 
and that QU (Rx)) is a quadratic extension of Q contained in H and 
disjoint from K. Hence Q(j(Rx)) is either Q (v5) or Q(V—3). We will 
see in a moment that j(Rx) € IR, so we must have Q(j(Rx)) = Q(v5). 
(This also follows from exercise 2.9, which says in general that j(c) € R if 
and only if ¢? = 1 in CL(Rx).) 

It remains to compute j(Rx) explicitly as an element of Q (v5 3 Let A 
and B be rational numbers so that 


integers Rx = Z[a], where to ease notation we will write a = 


j(Rx) = A+ BV5. 
From (4.3c) we see that j(a) is the Gal(K/K)-conjugate of j(Rx), so 
j(a) = A— BV5. 
Solving these two equations for A and B gives 


_HRK)+I@ yp — SRK) = 50) 


2 2/5 


In order to compute the two values of 7 numerically, we can use the g- 
series (I.7.4b), where g = e?*7 is the parameter for the normalized lat- 
tice Z+ Zr: 


A 


iQ = 2 7444+ 196884q + 214937604? + 864299970g" + 20245856256q* 
q 


+ 333202640600q° + 4252023300096q° + 44656994071935q" +-:-. 
Thus for Rx we find that 


G(Rx) = j(Z + a) = j (e?t*) = j(-e-¥) 
= j (—5.19748331238 - 107°) ~ —191657.832863. 
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(Notice in this case that q = —e~V18" € R, so j(Rx) € R.) Similarly for a 
we calculate 


1 
ila) = j(2B-+ a2) = j (2+ 502) = j(eVr% 
= j (2.27979896315 - 10731) = 632.83286254. 


Using these values gives (to 12 significant digits) A = —95512.5000002 
and B = —42997.5000001, so 


j(RK) & —95512.5 — 42997.5V5 = —52515 — 85995 5 


1 
IIE ay +V5 | 
2 
Thus j(Rx) is (at least approximately) integral over Z, which gives a nu- 
merical verification of Theorem 6.1 for this example. 


The Analytic Proof of Theorem 6.1 


Before beginning, we give a few words of motivation. It is not hard to see 
that an elliptic curve E has complex multiplication if and only if there is 
an endomorphism £ — EF whose degree is not a square. This suggests that 
we take an arbitrary elliptic curve FE and a positive integer n and study 
the set of all elliptic curves E’ for which there is an isogeny E — E’ of 
degree n. We took this point of view in (I §9,10) when we studied Hecke 
operators. What we are going to do is show that in this situation j(E’) is 
integral over Z\j (E)]. We will do this by explicitly constructing a monic 
polynomial F,, (j(£), X) with coefficients in Z[j(E)] having j(E’) as a root. 
Finally, if F has complex multiplication, then for an appropriate choice of n 
we can take E’ = E£. This means that F,(j(E),j(E)) = 0, which we will 
show implies that j(F) is integral over Z. 

We now begin the analytic proof of Theorem 6.1. We fix a positive 
integer n and recall the sets of matrices D, and §,, defined in (I §9): 


D.={(° 2) € M2(Z) : ad~be=n}., 


8. ={(5 1) €Ma(@) + ad =n, d>0,0<b<ah. 


We also recall (1.9.2), which says that 8, = SLo(Z)\D,. For any ma- 


trix a = & a) € M2(R) with deta > 0, we define the function j 0 a as 


: _fat+b 
poor) =3 cet+d/)- 


usual by the formula 
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Of course, if a € SL2(Z), then joa = 7. We are going to study the 
polynomial 
F,(X) = [] (X -joa) =o 5m_x™ 
ac$n m 
whose coefficients $,,, = 8m(7) are holomorphic functions on the upper half- 
plane H. More precisely, s;, is the m*® elementary symmetric function in 
the 7 0 a’s. We are going to prove several claims concerning the s,,’s. 


Claim 1: 8m(YT) = Sm(T) for all y € SL2(Z) and all 7 € H. 


Let y € SLo(Z). For any a € §, we have ay € Dy, so (1.9.2) says that 
there is a (unique) 6, € SL2(Z) such that é,ay is back in §,. Further, 
if d.ay = 6387 for some 8 € &,, then 6 = (53 6a)a, so (1.9.2) implies 
that a = Z. In other words, the map 


8n — 8n, at— bay, 


is one-to-one, hence is a bijection since 8, is a finite set. 
Now we observe that 


{jo(ay) : a@€8,} = {j06z' 0 (bsay) : aE 8,} 
= {j 0 (dgary) : @E Sn} since 7 is SL2(Z)-invariant 
={joa:a€8,} since 8, = {day : a € 8}. 


Hence any symmetric function on the set {joa : a € §,} will be invariant 
under 7 +> y7 for y € SLe(Z). In particular, this applies to the s,,,(7)’s, 
which completes the proof of Claim 1. 


Claim 2: 8m, € Cj]. 


In other words, we are claiming that there is a polynomial f,,(X) € C[X] 
such that 8m(T) = fm(j(7)) for allt € H. From Claim 1 we know that s,, is 
holomorphic on H and is SLo(Z)-invariant. In particular, s,(7+1) = sm(T), 
SO Sm has a Fourier expansion in g = e?7*7. We want to study what happens 
as T — too, or equivalently as q — 0. Recall (1.7.4b) that 7 has the Fourier 
expansion 7 = q7! + ese ceg*, so j has a pole of order 1 at ¢ = 0. Now 


ifa= (8 i) € Sy, then 


ori anh 0° 2mik th 
joa(r) =e + y Cre , 
k=0 


so in particular g"t1(j oa)(7) > 0 as q > 0. It follows from the definition 
of the s,,’s that there is an integer N such that q%s,,(T) > 0 as q > 0. 
This means that each s,,(7) is meromorphic at oo (see I §3), so sm is a 
modular function of weight 0 which is holomorphic on H. Now (I.4.2b) says 
that s,, € C[j], which completes the proof of Claim 2. 
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Claim 3: The Fourier expansion of s,, has coefficients in Z. 
To ease notation, let ¢ = e27/" and Q = qi/" = e?7"7/"_ For any a = 
(6 i) € 8, we have 


ori Beal : 
qoa(r) =e = Coe : 
(Note that ad = n.) Using the q-expansion of j(7) (I.7.4b), we find as 
above that 7 oa has the Q-expansion 


CO 
j oa(t) = Co + So eons, 
k=0 
where cp,¢1,... are integers. In particular, the Fourier coefficients of j oa 
lie in Z[¢], and so the same is true of the s,’s. 
Let o € Gal(Q(¢)/Q), and write ¢7 = ¢r(7) for some integer r(c) 
relatively prime to n. If we apply o to the Q-Fourier coefficients of 7 0 a, 
we get the series 


oo 
G 6 a)? - Cor(a)abg-a? 4 yee. 


k=0 


Comparing the series for 7 oa and (j 0 a)’, we see that 


(9(5 a) =92(5 2"): 


In general, the value of jo (4 b only depends on b (mod d), since 
Od 


GCG 2-04 


and j is SLa(Z)-invariant. Further, if r is any integer prime to n = ad, 
then the set {rb : 0 < b < d} is a complete set of residue classes modulo d. 
It follows that for any integer r relatively prime to n we have 


1i°(5 dn : é 1) Sob ={Gea: ae Sy}, 


Applying this with r = r(c) for 7 € Gal(Q(¢)/Q), it follows that 
{(joa)? : a€8,} = {joa:a€8,}. 


Now consider the Q-Fourier coefficients of the s,,(7)’s, which we know 
from above lie in Z[¢]. Since sm(7) is a symmetric polynomial in the 
functions {j o@ : a € §,}, we see that its Q-Fourier coefficients are fixed 
by Gal(Q(¢)/Q) and so lie in Q. Hence the Fourier coefficients of s,,(r) 
are in Z[¢]NQ = Z. Finally, we note that o,,(7+1) = 0,,(7) from Claim 1, 
SO Om is in fact represented by a Fourier series in g = Q”. This completes 
the proof of Claim 3. 
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Claim 4: S(T) € Z[j]. 


We already know from Claim 2 that s,, € C[j] and from Claim 3 that 5, € 
Z[q,q~']. We will show that 


Cl] ZI, q7"] = Zh], 

which will give the desired result. Let f(j) € Clj]M Zlg, q~'] be a polyno- 
mial of degree d, and write f(j) = aoj4 +a,j4-! +--+ + a4 with a; € C. 
Substituting in the g-expansion of 7 (1.7.4b) gives 

ao ay + 744dag 

f = + — 4+ --s—" 

qi qe 

so the fact that f € Z[¢,q~*] implies that aj € Z. Now 
f - oj? = aft" +++ +04 € Ci] NZ, q7 "1, 


so repeating the above argument gives a; € Z. Continuing in this way, we 
find that every coefficient of f in Z, which completes the proof of Claim 4. 


Combining Claims 1, 2, 3, and 4, we have completed the proof of the 
first half of the following important result. 


Theorem 6.3. (a) There is a polynomial F,,(Y,X) € Z[Y, X] so that 


[] % -f°0) = FG. X). 

ae8n 
(b) Let @ € M2(Z) be a matrix with integer coefficients and det G > 0. 
Then the function j 0 8 is integral over the ring Z[j]. 
(c) Ifn is not a perfect square, then the polynomial H,(X) = F,(X,X) 
is non-constant and has leading coefficient +1. 


Proor. (a) The four claims proven above say that 


ll (X —joa) =) > smX™ with sm € Z[j]. 
aeS8n m 

(b) Let nm = det 6, so G € D,. Using (1.9.2), we can find a matrix y € 
SL2(Z) such that 73 € 8,. The SL2(Z)-invariance of j says that jo 6 = 
jo (y¥8), while the definition of F,, shows that X = j o (yf) is a root 
of F,,(j,X). Since F;, is monic by definition and has coefficients in Z[j] 
from (a), it follows that j o @ is integral over Z|j]. 

(c) Leta = (6 i) € 8,. Then using the Q-expansion of 7 o a described 
above during the proof of Claim 3, we see that the Q@-expansion of 7 — joa 
is 


i} 400V= i = nk \ TAS — abk —a*k ; 
j-joa (+ 300") (cage + a6 Q 
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(Here we are again writing ¢ = e?7/" and Q = q'/".) Since n is not a 
square, the leading terms cannot cancel, so 7 — 7 9 @ has a pole as Q — 
0 and the coefficient of the leading term is necessarily a root of unity. 
(Precisely, the coefficient is 1 if n > a?, and it is —C~?° otherwise.) It 
follows that F),(j,7) has a pole as @ — 0 and that the leading @-coefficient 
is a root of unity. But the Q-expansion of F;,(j, 7) has integer coefficients, 
so the leading coefficient is a root of unity in Z; hence it must be +1. 
Further, F,,(j,7) is actually a series in g = Q”, so we have proven that 


bs 1 4 
Fr, 5) = oor +---€q "Z{q] 


for some m > 1. But we also know that F,(j,7) € Zly] and that j has a 

simple pole at g = 0. Hence F,(j,j) = 47 +--- € Z[j], which proves 

that F,,(X,X) is a non-constant polynomial with leading coefficient +1. 
oO 


It is now a simple matter to complete the proof of Theorem 6.1. 


Corollary 6.3.1. (Theorem 6.1). Let E/C be an elliptic curve with 
complex multiplication. Then j(E) is an algebraic integer. 


Proor. Let R & End(£) be an order in a quadratic imaginary field K. We 
consider first the case that R = Rx is the ring of integers of kK. Choose 
some element p € R such that n = INS | is not a perfect square. For 
example, if K = Q(i), take p = 1+, and if K = Q(V—D) with square- 
free D > 2, take p = /—D. Then (1.5b) says that the isogeny [op]: E - E 
has degree n. Fix a 7 € H with j(7) = j(F£). Then multiplication by p 
sends the lattice Zr + Z to a sublattice of index n, say 


pT =aTt+b F 
for some a,b,c,d € Z with ad — be = n. 
p=crt+d 


So if we let a = c i) € Dy, then 


ar) = 3 (SE) = str) = 318), 


By definition, jo a is a root of F,(j, X), so if we substitute X = joa and 
evaluate at 7, we get 


0= Fr (i(7), j(ar)) = F,(j(E), j(E)) = H,,(j(E)). 


From (6.3c), the polynomial H,,(X) has integer coefficients and leading 
coefficient +1. This proves that j(£) is integral over Z. 
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Now we deal with the case that R is an arbitrary order in K. Let A = 
Zw, + Zw be a lattice for E. From [AEC VI.5.5] we know that K = 
Q(w/w2). Hence replacing A by AA for an appropriate \ € C*, we may 
assume that A C Rx. We also choose a7 € H so that Ry = Zr +Z. Then 
we can write 
w, =aTt+b 
for some a,b, c,d € Z. 
wg=cT+d 
Let n = ad — be. Switching w, and we if necessary, we may assume 
that n > 1. The matrix a = & q) is in D,, so (6.3b) says that the func- 
tion j 0 @ is integral over the ring Z[j]. Taking the equation F,,(j, X) = 0 
which gives that integrality and evaluating it at 7, we find that j(a7) is 
integral over Z[j(7)]. But j(aT) = j(E), and we already know that j(r) is 
integral over Z because it is the j-invariant of an elliptic curve with complex 
multiplication by Rx. Therefore 7(£) is integral over Z. Oo 


Example 6.3.2. The polynomials F,,(Y, X) € Z[Y, X] and H,(X) € Z[X] 
described in (6.3) can be extremely complicated. For example, 
F,(Y,X) = —(XY)? + X93 +Y% + 24.3-31-XYV(X+Y) 
+ 34.5% .4027- XY — 24.34 .53(X? 4 Y?) 
428 . 37 .5°(X + Y) — 2)%. 39.59, 
Ho(X) = —X*4+2-1489- X°+434-54.17-47-X? 
+29 . 37.58. x — 212. 39.59. 
According to [AEC III.4.5], the elliptic curves E : y? = 2° + ax? + br 
and E’ : y? = x? — 2ax? + (a? — 4b)x are connected by an isogeny E — E” 
of degree 2. It follows from exercise 2.19 that 
. . 256(a? — 3b)? 16(a? + 12d)? 
Fi(j(E), j(£’)) =F = 0, 
2(5(2), (E)) = ( b2(a2 —4b) ’ b(a2 — 4b)? 
a fact that the interested reader can check by a direct computation (prefer- 
ably with the assistance of a symbolic calculator). 


The £-adic Proof of Theorem 6.1 


We now begin the ¢-adic proof of Theorem 6.1. If E/L is an elliptic curve 
with complex multiplication, then we know from (2.3) that the action 
of Gal(L/L) on Etors is abelian, so in particular the action on the Tate 
module T;(£) is abelian. (For another proof of this result which uses only 
a little linear algebra and is valid even if End(£) is not a maximal order, 
see exercise 2.6.) We now use Proposition 2.3, local class field theory, and 
the criterion of Néron-Ogg-Shafarevich to prove that EF has everywhere po- 
tential good reduction. Although somewhat involved, this proof has the 
advantage that it generalizes to abelian varieties of arbitrary dimension 
(see Serre-Tate [1]). 
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Theorem 6.4. Let L be a number field and E/L an elliptic curve with 
complex multiplication. Then E has potential good reduction at every 
prime of L. 


Proor. (Serre-Tate) Every endomorphism of R is defined over a finite 
extension of L (2.2b), so replacing L by a finite extension, we may assume 
that End;,(£) is strictly larger than Z. Fix a prime v of L. We set the 
following notation: 


L, = the completion of L at v, 
R, = the ring of integers of L,, 
Mt, = the maximal ideal of R,, 
p=char R, /2, = the residue characteristic of Ry, 
é =a rational prime not equal to 2 or p, 
I, = the inertia subgroup of Gal(L,/Ly), 
Le = the maximal abelian extension of L,, 
I> — the inertia subgroup of Gal(L2>/L,). 
By assumption, End, (E) # Z, so certainly Endz,(£) is strictly larger 
than Z. Applying (2.3), we see that the action of Gal(L,/L,) on T,(E) is 


abelian. In particular, J, acts through the quotient [2>. 
Local class field theory says that there is an isomorphism 


[a> = R*. 
(See, e.g., Lang [5], Serre [4, XIV §6, Cor. 2(ii) to Thm. 1], Serre [5].) This 
gives us a very good picture of [2>, since we can decompose R* using the 


exact sequence 


LS v,1 — RF > (R,/M%)" — 1. 
— 


pro—p group | 2 finite 
Bee 
Here Rj, is the group of 1-units, 
o1 = {ue RE: u=1(modM,)}. 


There is an isomorphism from the formal multiplicative group Gn (My) 
to Ri, given by 


Gm(My) Ry trol+t+t. 
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Hence Rj, is a pro-p group; that is, it is the inverse limit of finite groups 
of p-power order (see [AEC IV.3.1.2] and [AEC IV.3.2]). 

Similarly, if we fix an isomorphism Aut T(E) = GL2(Z,) correspond- 
ing to some basis for T;(E), then there is an exact sequence 


CS Ge ie, = Ca), =. -CErzi 2. 1 
ee 


pro—é group ||? Il? 


AutTp(E) —> Aut E[é 
——— 
finite 
Here GL2(Zz); is the group of matrices congruent to the identity matrix 
modulo @, and it is not hard to see that this is a pro- group. More precisely, 
the logarithm map gives an isomorphism 


re we Caer 
GL2(Ze)1 —> M2(lZp),  1+€Ar-> log(1+€A)= >> ea aaa 
n=1 


where M]2(Ze) is the group of 2 x 2 matrices with coefficients in Ze under 
addition. This isomorphism is the GL2 analogue of [AEC IV.6.4b]. (See 
also exercises 2.22 and 2.23.) 

It follows from the above discussion that the map 


I, — Aut Te(E) 
fits into the following diagram: 


i 


1 — ney — ie — (R,/M)> — 1 


1 — GLa(Ze)1 —_ Aut T;(£) — GL» (Z/€Z) — 1 


Next we observe that since @ # p, then there can be no non-trivial ho- 
momorphisms from a pro-p group to a pro-¢ group, so the images of Rj, 
and GLo(Z,); in Aut T(E) have trivial intersection. Therefore there is an 
injection 

Image(R} , —> Aut Te(E)) —> GL2 (Z/éZ). 
Since also (R,/,)”* is finite, it follows that 


Image(R* —+ Aut T;(£)) _ is finite, 
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since it consists of finitely many cosets of Image(R*,, —> Aut T;(E)). 
This proves that the image of I, in Aut T;(£) is finite. Now the crite- 
rion of Néron-Ogg-Shafarevich (specifically [AEC VII.7.3]) says that E has 
potential good reduction at v, which concludes the proof of Theorem 6.4. 
O 


It is now a simple matter to deduce (6.1) from (6.4). 


Corollary 6.4.1. (Theorem 6.1). Let E/C be an elliptic curve with 
complex multiplication. Then j(E) is an algebraic integer. 


Proor. The elementary result (2.1b) says that j(£) is an algebraic num- 
ber, so we may take an equation for & with coefficients in the number 
field L = Q(j(E)). Then (6.4) says that E has potential good reduction 
at every prime of L, so [AEC VII.5.5] implies that 7(£) is integral at every 
prime of L. Oo 


§7. Cyclotomic Class Field Theory 


In this section we are going to formulate the class field theory of Q in 
terms of special values of analytic functions, specifically special values of 
the exponential function. This is analogous to the way we will later be 
describing the class field theory of quadratic imaginary fields via the theory 
of complex multiplication. We hope that studying the simpler cyclotomic 
case first will aid the reader in understanding the more intricate proofs 
required in the complex multiplication case. However, the results in this 
section will not be used later, so the reader who already feels comfortable 
with class field theory may wish to skip directly to §8. 
We begin with the multiplicative group 


COR C, 


The exponential map provides a complex analytic parametrization of the 
multiplicative group, 


f: C/Z — Gmp(C) 


t e2tit 


Sitting inside of G,(C) is its torsion subgroup 


Gin(C) tors = f (Q/Z) . 


The elements of Giu(C)tors are roots of unity, so they generate abelian 
extensions of Q. Our aim is to give an analytic description of the action 
of Gal(Q?®/Q) on Gin(C)tors- 
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From class field theory (see §3) we have the reciprocity map, which is 
a surjective homomorphism 


AQ — Gal(Q"”/Q), — s-> [s,Q]. 
Each idele s thus defines an isomorphism 


Gin(C)tors a= Gin (C)tors; 
C os cls Q, 


The algebraic action of Gal(Q®”/Q) on Gr(C)tors is determined by these 
isomorphisms. 

In general, if z € AG is any idele, we want to define a subgroup xZ C Q 
and a multiplication-by-z map 


Q/Z > Q/zZ. 


The definition of xZ is easy; it is just the ideal of x, which we recall is the 
fractional ideal of Q given by 


Z = (x) = || pe" -Z = N,Z. 
Pp 


For convenience, we will write N, as indicated for a rational number gen- 
erating the ideal rZ. Later we will pin down N, precisely by requiring 
that sign(NV,) = sign(z.). 

In order to define the multiplication-by-z map, we decompose Q/Z 
into its p-primary components and multiply the p-component by zp. Note 
that 

(p-primary component of Q/Z) = Z[p~']/Z & Q,/Zp. 


The first equality is immediate, since if t = a/n € Q/Z has p-power order, 
then n must be a power of p. For the second equality, we clearly have an 
injection 

Z|p~*]/Z — Qp/Zp. 
To check surjectivity, let € € Q,/Zp. We can write € = a/p* for some a € 
Zp and some integer e > 0. Choose an integer a € Z with a = a (mod p°Z,). 
Then an a 


pe = oe = € in Q,/Zp. 
Similarly, we observe that for any N € Q*, the p-primary part of Q/NZ is 
isomorphic to Q,/NZp. 

It is a general fact that an abelian group whose elements all have finite 
order is the direct sum of its p-primary components. (See (8.1) and (8.1.1) 
for something stronger.) Hence 


Q/Z = B® /Zp, 
P 


ese Z[p—*]/Z and 
pe 


and for any idele x € AQ, 
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Q/2Z = Q/NzZ = ED Qp/NaZp = GD Qp/zpZo. 
P Pp 


The last equality follows from the fact that ord,(Nz) = ord,(x,), so the 
ideals NZ, and z,Z, are the same. Now we can define the multiplication- 
by-z map to be multiplication of the p-component by zp; in other words, 
multiplication-by-x is defined by the commutativity of the following dia- 
gram: 


Q/Z =. Q/xZ 


[ i 
QB Q,/Zp — BD Q,/2pZp 


Pp 
(tp) —? (Lptp) 
We are now ready for the main theorem of this section. The reader 
should compare this cyclotomic result (7.1) with the corresponding complex 
multiplication theorem (8.2). 


Theorem 7.1. Fix the following quantities: 
o € Aut(C), an automorphism of the complex numbers, 
s € AG, an idele of Q satisfying [s,Q] = alga». 
Further, fix the complex analytic isomorphism 
f :C/Z — G,(C), fH". 
Then there exists a unique complex analytic isomorphism 
f':C/s3Z — Ga(C) 
so that the following diagram commutes: 


Q/Z ~— Q/s'Z 


| [9 


Gm(C) —> Gn(C). 


Remark 7.1.1. Theorem 7.1 says that 
f(ty\eS = f(s“) for allt € Q/Z. 
Of course, f’ depends on s. We will see during the proof of (7.1) that 
Posen, 
where N, is a certain non-zero rational number. Thus written out explic- 
itly, (7.1) says that 


(c2rit) 2 aes e27iNe(s~'t) for all t € Q/Z. 


This formula makes it very clear how the algebraic (Galois) action of [s, Q] 
is transformed into the analytic (multiplication) action t > N,s~'t. Later 
we will have more to say about N,, see (7.2). 
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Proor (of Theorem 7.1). Let ¢ € Q/Z, say t = a/n(mod Z) as a frac- 
tion in lowest terms. To ease notation, let ¢ = f(t) be the corresponding 
primitive n‘*-root of unity. Suppose first that our idele s has the property 

8) =1 (mod nZ,) for all primes p, and further that s.>0.  (*) 
In particular, s, is a unit for all primes dividing n, and we know that Q(¢) 


is ramified only at these primes, so by (3.5) the action of [s, Q] on Q(¢) is 
given by the Artin symbol 


[2, Qloce = ((s), Q6)/Q). 


For any idele s, we will write N, € Q* for the unique rational number 
satisfying 


N,Z = (s) = sZ and sign(N,) = sign(So.). 


Then ((s),Q(¢)/Q) = (NsZ, Q(¢)/Q), from which it follows that ¢l@l = 
CNs, or equivalently 
f(t)Ol = (ey. 


Next we decompose t (mod Z) into p-primary components, 


i= =), a —E DQ /Zp. 
P P 


per 


= from (*), which says that a. —1énZ, = p?Z, 


This suggests that we should take f’ to be the map f, defined by 
fe:C/s'Z——+Gp(Z), — f(t) = e?™*N**. 
Then for any idele s satisfying (*), we have 
fe(s7 it) = f(t) = e27*Net = f(t)Ne = f(t) 2, 
To recapitulate, we have proven that 


f(t)& A = f,(s7 14) provided t = “ € Q/Z and s satisfies (*). 
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Now let s € Ag and t € Q/Z be arbitrary, and as usual write t = 
a/n(mod Z). Using the weak approximation theorem, we can find a ratio- 
nal number r € Q* so that rs satisfies (x). From the definitions, it is easy 
to check that 


Nrs =TNg and (rs)-t =r7}(s7'2). 


Notice that the requirement sign(N,) = sign(so.) ensures that N,;. = rNs, 
even when r is negative. Using these equalities, we compute 


fj = fers? since [rs, Q] = [s,Q 
= frs((rs)~*t) from above, since rs satisfies (*) 


= e2tiNrs(rs)"'t from the definition of frs 


= e2miNe-s't since N;, = rN, and (rs)~'t = r—!(s7't) 


= fae) from the definition of fs. 


This completes the proof of the existence half of (7.1), with the additional 
information that f’ is given by the map f’(t) = f,(t) = e?™*Ns*. As for 
uniqueness, we need merely observe that the commutative diagram deter- 
mines f’ on Q/s~!Z, which is a dense subset of G(C), so there is at most 
one possibility for f’. oO 


As an alternative version of (7.1), we could use only the single analytic 
parametrization f and replace the multiplication-by-s~! map so as to make 
the following diagram commute: 


Q/Z ---> Q/Z 
\s |: 
Gy (C) Gy (C). 


We can (try) to do this because every o € Aut(C) maps Gi(C) to itself. 
In the case of an elliptic curve EF, this will only be possible for those o’s 
such that E° = EF. For the elliptic analogues of our next two results (7.2) 
and (7.3), see (9.1) and (9.2). 


Theorem 7.2. Let s € Ag be an idele. With notation as in (7.1), there 
is a unique rational number N, € Q* such that the following diagram 


commutes: N.s7} 
—  Q/Z 


Q/Z 


Is [3 
Exo) = Ge: 


More precisely, N, is the unique rational number satisfying 


N,Z= (s) = sZ and sign(V,) = sign (Soo). (*) 
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Proor. During the proof of (7.1) we showed that f’ is the map f’(t) = 
e?!Nst where N, is as specified in (x). The commutative square in (7.1) 
then says that 


F(t) = fl(s71t) = e2P*Nos"t = F(N st), 


This proves that the square in (7.2) is commutative with N, chosen to 
satisfy (*). It remains to prove that this commutative diagram uniquely 
determines N,. But if N{ also makes the diagram commute, then we 
find that multiplication by N>!N/ induces the identity map on Q/Z. 
Hence N, = Nj. oO 


From (7.2) we have a well-defined map 
Ag — Oc C, st Ng, 


and it is clear that this map is a homomorphism. Further, the explicit 
description of N, given by (*) in (7.2) shows that the map is continuous. 
Recall that for any number field L, a homomorphism 


x: Az — C 


is called a Gréssencharacter of L if it is continuous and satisfies ,(Z*) = 1; 
that is, the kernel of y must contain the image of L* in Az. It is easy to 
see that our map s +> N, does not have this property. In fact, if s is the 
image of some a € L*, then clearly N, = a. We can get a Grdssencharacter 
by making a small modification to Ng. 


Theorem 7.3. For any idele s € Ag, let 8. be the archimedean compo- 
nent of s. Define a map 


x:Ag—- RCC, Vieha NS 
where N, € Q* is the unique rational number satisfying 
N,Z = (s) = sZ and sign(N,) = sign(soo). 
Then x is a Gréssencharacter of Q. 
PrROoF. It is clear that both of the maps 
st— N, and SE Seg 


are continuous homomorphisms from Ag to C*. Further, they clearly take 
the same value on the image of Q* in Ag. Hence x is a continuous homo- 
morphism that is trivial on Q*; that is, it is a Grdssencharacter. 

O 
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Let K be a quadratic imaginary field with ring of integers Rx as usual. 
For each prime ideal p of K, let Ky, be the completion of K at p and let Ry 
be the ring of integers of K,. Similarly, if a is any fractional ideal of K, 
let ay = aR, be the fractional ideal of K, generated by a. 

Let M be an Rx-module. The p-primary component of M, which by 
definition is that part of M annihilated by some power of p, is denoted by 


M[p®] = {me M : p°m = (0) for some e > 0}. 


We begin with an elementary lemma about p-primary decompositions. 


Lemma 8.1. (a) Let M bea torsion Rx-module; that is, for everym € M 
there is a non-zeroa € Rx such that am = 0. Then the natural summation 
map 


S: Mp SM, 8) = Tap, 
p p 


is an isomorphism. Here the sums are over all prime ideals of Rx, and [bp 
denotes the p-component of pL. 

(b) Let a be a fractional ideal of K. Then for each prime ideal p of K, the 
inclusion K — Ky induces an isomorphism 


T : (K/a) [p*] — Ky/dp. 


(c) Again let a be a fractional ideal of K. Then there is an isomorphism 


K/a = @ Kp/ap. 
p 


Remark 8.1.1. As our proof will show, Lemma 8.1 is true more gen- 
erally for any Dedekind domain R with fraction field K. For example, 
taking R = Z gives the decomposition of a torsion abelian group into p- 
primary components as discussed in 87. 


ProoF (of Lemma 8.1). (a) Suppose first that « € ker(S). For each 
prime p, let e(p) > 0 be the smallest integer such that py, = (0). 
Note that e(p) exists, since up € M[p%], and that all but finitely many 
of the e(p)’s are zero since yz has only finitely many non-zero components. 
Now fix a prime ideal q and let 


v0= I] po), 


pAq 
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By construction, we have df) = (0) for all p # q. On the other hand, 
since S(js) = 0 by assumption, we have 


(0) = 0S(u) =2S 0 up = S_ duty = 1g. 
p p 


But 0 is relatively prime to q, so 0 + q°9) = (1). Hence 


(14g) = (2+ 4°) joy = 2904 + 4° 4g = (0) + (0) = (0), 


which proves that j1qg = 0. Since q was arbitrary, we have proven that = 0 
and hence that S is injective. 

Next we check surjectivity. Take any element m € M, and choose a 
non-zero a € Rx with am = 0. Factor the ideal aR as 


ORK = pips’ +++ pr”. 
Then we can find €1,...,¢, € Rx satisfying 


: F _ f L (mod p§*), 
Ey ter-tep= an c= 0 (mod p*) rae 
[Proof: Let e; = ap~®. Then e; +---+e, = (1), so it suffices to take ¢; € e; 
with €;+---+¢, =1.] Notice that p®e; C aRx, so p*'eym C amR«K = (0). 
Hence e;m € M[p%], so if we set 


_ fem ifp=p,; forsomel <i<r, 
Be) otherwise, 


then « € @M[p%] and S(u) = Soup = crm +---+e,m = m. This 
completes the proof that S is surjective. 

(b) First, suppose that @ € (K/a) [p°] is in the kernel of T. Choosing a 
representative a € K for a, this means that p®a C a for some integer e > 0 
and that a@ € dp = aR,. These two inclusions imply respectively that 


ord,(a@) > ordg(a) for allq Ap and ord,(a) > ord,(a). 
Therefore aRx C a, so a € a, which means that @ = 0. This proves that T 

is injective. 7 
Next, let 6 € K,/ap and choose a representative 6 € K, for 3. By the 


weak approximation theorem (essentially the Chinese Remainder Theorem) 
we can find ana € K satisfying 


ord,(a — 8) > ord,(a) and ord,(a) > ordg(a) for allq fp. 
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The first inequality says that a = 3 (mod ay), so T(@) = 3. Let e€ be a non- 
negative integer greater than ord,(a) — ordp(@). Then ord,g(p°a) > ord,(a) 
for all primes q, including q = p, so p°a € a. Hence @ € (K/a) [p*], which 
completes the proof that T is surjective. 

(c) This is immediate from (a) and (b). oO 


Let x € Aj be an idele. Recall that the ideal of x is the fractional 


ideal 
(x) = ere. 
p 


If a is any fractional ideal of K, we define ra to be the product (x)a. Using 
the equality (x)p = (x) Rp = tpRp, we see that 


(xa)y = (w)aRy = rpaRy = Xpay. 


Now (8.1c) gives natural isomorphisms 
K/a= @ Kp/ap and K/aa = @ Ky /xpap. 
p p 


We define the multiplication-by-z map on K/a to be multiplication 
of the p-primary component by zy. In other words, multiplication-by-z is 
defined by the commutativity of the following diagram: 


K/a = K/xa 


b I 
D Ky /a > D Kp /zpap 
p p 
(tp) aaa (xptp) 
We also recall from §3 that the reciprocity map for K, 
Ax —> Gal(K®?/K), s+ [s, K], 


is surjective and its kernel contains K*. 

We are now ready to state and prove the main theorem of complex 
multiplication in its adelic formulation. At the risk of making the statement 
overly long, we include a summary of our notation and assumptions. 


Theorem 8.2. (The Main Theorem of Complex Multiplication) Fix the 
following quantities: 


K/Q a quadratic imaginary field with ring of integers Rx, 
E/C an elliptic curve with End(E) & Rx, 
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a € Aut(C), an automorphism of the complex numbers, 
8 € Ax, an idele of K satisfying [s, K] = o| xa». 
Further, fix a complex analytic isomorphism 


f:C/a — E(C), 


where a is a fractional ideal of K. Then there exists a unique complex 
analytic isomorphism 


f':C/s7'a —> E?(C) 
(depending on f and co) so that the following diagram commutes: 


K/a a K/s~*a 


l/r 


E(C) 4 E(C). 


Remark 8.2.1. The statement of Theorem 8.2 remains true for elliptic 
curves whose endomorphism ring is a non-maximal order of kK. Of course, 
one first must explain how to multiply K/a by an idele x when a is an 
arbitrary lattice in K. For details, see Shimura [1] or Lang [1, Ch. 8, 10]. 


Remark 8.2.2. Notice how Theorem 8.2 transforms the algebraic action 
of o on the torsion subgroup f (K/a) = Etors into the analytic action of 
multiplication by s~!: 


FOP af a) fort € K/a and s € AX. 


Compare with (7.1.1). 


PRooF (of Theorem 8.2). Clearly, there is at most one f’, since the 
commutative diagram determines f’ on K/s~‘a, which is a dense subset 
of C/s~'a. 

Suppose that E,/C is an elliptic curve that is isomorphic to E and 
that f; : C/a; — E\(C) is an analytic isomorphism. We are going to 
begin by proving that if Theorem 8.2 is true for (£4, fi), then it is also 
true for (E, f). This will allow us to reduce to the case that E is defined 
over Q(j(E)) and a is an integral ideal. 

So we are assuming that there is an analytic isomorphism 


f:C/s7ta, SS E7(C) 
and a commutative diagram 


K/ay = K/s~'a; 


[1 [f 


E\(C) —> £E7(C). 
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Since E and EF are isomorphic, we may fix an isomorphism 7: E, — E. 
Further, the lattices for EF and FE, are homothetic [AEC VI.4.1.1], so we 
have a, = ya for some y € K*. Then each of the squares in the following 
diagram is commutative: 


K/a => K/s~'a 


h 


K/a, — K/s~‘a 


fh [« 


C) — EC) 


( 
|: |" 


E(C) > EC). 


Ey 


Hence (8.2) is true for (£, f) if we take for f’ the map 
fi:C/sa EC), f(z) = 87 0 fi(y2). 


We are now reduced to proving (8.2) under the assumptions that E 
is defined over Q(j(£)) and that a C Rx is an integral ideal. Fix an 
integer m > 3 and let L/K be a finite Galois extension satisfying 


EVEL and E|[m] c E(L). 


We note that (5.6) implies that L contains K(,,), the ray class field of K 
modulo m. We are going to begin by proving that (8.2) is true on the m- 
torsion points of &. As usual, our main tool will be reduction modulo a 
suitable prime. 


Let $8 be a prime ideal of L satisfying the following five conditions: 


(i) ol, = (RB, L/K); that is, the restriction of o to L is a Frobeniuse- 
lement for $B. 
(ii) p = BOK is a prime of degree 1; that is, p = NOP is a rational 
prime. 
(iii) p is unramified in L. 
(iv) p is not one of the finitely many primes excluded in (4.5). 
(v) $8 does not divide m. 


Such an ideal always exists, since the Tchebotarev Density Theorem (Lang 
[5, Ch. VIII, Thm. 10]) says that there are infinitely many primes satis- 
fying (i) and (ii), whereas each of (iii), (iv), and (v) excludes only finitely 
many primes. 
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Using the fact that DL contains K(j,), we find 
[s, << = Ol seem) since [s, K] = o|xa» by assumption 
= (p, K(m)/K) from (i) and (ii). 


This last map is the Frobenius element associated to p. Let 7 € AX be an 
idele with a uniformizer at the p-component and 1’s elsewhere. Then (3.5c) 
says that [x, K] also equals (p, K(m)/K). Hence [s1~', K] acts trivially 
on K(m), 80 the idelic characterization (3.6) of the ray class field K(m) says 
that the idele s7~! factors as 


st! = au. 
Here a € K™, and for each prime q, u € Aj, satisfies 
Ug € Ry and Uq =1(mod mR,). 
Next we use (5.3) to find an isogeny 


A: E — E? 


whose reduction modulo 8 is the p'*-power Frobenius map. Note that 
since L contains K (j(£)), (i) implies that 


| rej) = (BL/E)| ers) = (p, K(9(£))/K). 
Since E is defined over Q(j(E)), we see that the isogeny described in (5.3) 
is indeed from E to E”. 


We claim that on m-torsion points, \ acts like a. To see this, let T € 
E|m] and use tilde’s to denote reduction modulo $8. Then 


MP) = A(T) = T°, 


since both A and ol, = ($8, L/K) act on the residue field modulo $B as 
the p'*-power map. Now % { m from (v), so [AEC VII.3.1b] says that 
on m-torsion points the reduction map E?[m] — E?|m]| is injective. Hence 


MT) = T? for all T € E[ml. 


In other words, we have produced a commutative diagram 
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(N.B. This diagram describes a deep mathematical relationship, because it 
transforms the algebraic action of o on m-torsion points into the geometric 
action of A. Notice that for general points P € E(K), \(P) and P? will 
not be equal.) 

We also note from (4.3d) that E7 is isomorphic to p x &, where p * E 
is an elliptic curve associated to the lattice p~!a. Using the given analytic 
isomorphism f : C/a — E(C), this means there is an analytic isomor- 
phism f” : C/p~'a — E?(C) so that we have a commutative diagram 


Cy ie 2¢ 

! | 
C/a —> C/pla (*) 
| |r 


E(C) > E(C). 


Recall that we factored the idele s as s = amu. Since every component 
of u is a unit, and every component of 7 is 1 except for a uniformizer in 
the p-component, we have 

(s) = (a)(r) = (a)p, andso s-'a=a ‘pla. 


Thus multiplication by a~! gives an isomorphism 
=9 
C/p-ta > C/s~a, 
so we can extend (*) to form the larger commutative diagram 


G -2&. G2 2 2G 


| | ! 


C/a —> C/p'a — C/s a () 


lr |r lr 


E(C) +> #e7(c) “4 E£-(C). 


Here f’ : C/s~'a + E?(C) is the unique analytic isomorphism making (**) 
commute. We claim that f’ satisfies 


f(t)” Z f'(s_1t) for all t € m~'a/a. 
To verify this claim, we note from above that 


f()” = AF). 
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Combining this with the commutativity of (**), we must check that 
fi(av't) 4 f'(s7) for all t € m~'a/a; 


and since f’ is bijective, this is equivalent to showing that 


2 
att-s testa for all t € m7!a. 


Recalling that s-'t € K/s~‘a is defined by multiplying the q-primary 
component by sg, we must prove for each prime q of K that 


? 
Ot ste sztag for allt €m-"aj. 


Now multiplying through by sq, and using the decomposition sg = 
ATgug from above, we must check that 


? 
Tqugt-tEa,  forallt € m~'a,; 


or equivalently that 
2 
(TqUq — Lag C mag. 


By construction, ug is in Rj and satisfies uy = 1(mod mRq), so we are 
reduced to proving 


2 
(tq — 1)dg C mag. 


There are two cases to consider. First, if q # p, then 7, = 1, so we are 
done. Second, if q = p, we know that mp is a uniformizer, so (7)—1)ap = ap. 
Further, we know from (v) that p {m, so m is a p-adic unit and hence may = 
dy. This proves the desired inclusion for all q, thereby completing the proof 
of our claim. 

To recapitulate, for each integer m > 3 we have produced an analytic 
isomorphism 


fin: C/s-ta > E7(C) 
and a commutative diagram 


24, 
mta/fa *+ m's~ta/s7'a 


[5 ee 
E(C) E?(C). 
To complete the proof of (8.2) it suffices to show that all of the f;, maps 


are the same, since then these commutative diagrams will fit together to 
give the desired result on all of K/a. 
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So let n > 1 be an integer, and let f’,,, : C/s~'a —>+ E°(C) be the 
corresponding analytic isomorphism. Note that the composition f/,,,° (le 
is an automorphism of E’, say f!,,,0 fi,’ = [€] € Aut(E7). Then for 
any t € m~'a/a we have 


ele finls =f. (s 2B) by definition of € 
= f(t)” from construction of f/,,, 


=e ae) from construction of f/,. 
This holds for all t € m~!a, so we conclude that 
(JT =T for all T € E?[m]. 
Since m > 3, this can only happen if [€] = [1], since for [€] 4 [1] the kernel 
of [1 — €] contains at most six points. (If j(£) 4 0,1728, then € = +1, 


and in the two exceptional cases, £4 = 1 or €© = 1. See [AEC III.10.1].) 
Therefore f/,,. = f/,, which concludes the proof of (8.2). oO 


§9. The Associated Grdssencharacter 


In this section we will use the main theorem of complex multiplication 
to define a Grossencharacter associated to an elliptic curve with complex 
multiplication. Recall that a Gréssencharacter on a number field L is a 
continuous homomorphism 


wy: A, —C 


with the property that w(L*) = 1. Our first result describes a map Aj, > 
C* which, with some small modifications, will be the desired Grodssencha- 
racter. 


Theorem 9.1. Let E/L be an elliptic curve with complex multiplication 
by the ring of integers Rx of K, and assume that L D K. Let x € Ay 
be an idele of L, and let s = Nix € Aj. Then there exists a unique a = 
ag/_(£) € K* with the following two properties: 

(i) aR =(s), where (s) C K is the ideal of s. 

(ii) For any fractional ideal a C K and any analytic isomorphism 


f : C/a as E(C), 
the following diagram commutes: 


K/a “=> K/a 


ls | 


E(L) =) EL), 
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Before beginning the proof of (9.1), we should make a few remarks. 
First, from (2.3) we know that K (j(E), Etors) is an abelian extension 
of K(j(£)). Further, since E is defined over L, we know that j(E) € L, 
and so we see that L( Fors) C L*>, This shows that the images of the verti- 
cal maps in (ii) do lie in E(L*>). Second, since K (j(£)) is the Hilbert class 
field of K from (4.3), we know that N/-2 is principal for any ideal 2 of L. 
Since (s) = (Nkx) = N%((z)), we see that there always exists an a € K* 
satisfying (i), and (i) determines a up to a unit of K. It then remains 
for (ii) to pin a down precisely. Third, we note that (i) gives 


as-'a=a(s)~'a=a, 


so the top row of the diagram in (ii) is well-defined. 
PRrooF (of Theorem 9.1). Let 


L' = L{Eters)- 
Since 7(£) € L, it follows from (5.7) and (2.3) respectively that there are 
inclusions 

(iam eae Efcaw Byes 

Choose an automorphism o € Aut(C) such that 

a|pav = [z, [}. 

A standard property of the reciprocity map (3.5b) says that 
alka = ee bllgees = [s, K], 


so applying the main theorem of complex multiplication (8.2), we find an 
analytic isomorphism f’ : C/a — E(C) and a commutative diagram 


K/a = K/s~‘a 


bo dr 

E(C) > E7(C). 
Now E?% = E, since o fixes L. Hence a and s~!a must be homothetic, so 
there is a G € K* such that Gs~ta = a. Our commutative diagram then 


becomes 2h 
K/a pes K/a 


[3 |r 


ECY.. 2S. BO): 
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Note that f” o f~! is an automorphism of E, say f” = [€]o f. If we 
set a = £3 and use the facts that o|za» = [z, L] and Etors C E(L?”), we 
get 


K/a “5  K/a 


| |) 
E(L®) = E(L*), 
which is exactly (ii). Further, we have an equality of ideals 
as ta= $s ta =a, so aRK = (s). 


This proves that a satisfies both (i) and (ii), which completes the proof of 
the existence of part of (9.1). 

Next we check that a is unique. Suppose that a’ € K* also has prop- 
erties (i) and (ii). From (ii) and the fact that f and [z, L] are isomorphisms, 
we get a commutative triangle 


K/a 


K/a Se K/a 


Hence multiplication by a’a~' is the identity map on K/a, so a’ =a. 

Finally we must show that a is independent of the choice of f. Suppose 
that f’ : C/a’ — E(C) is another analytic isomorphism. Then a = ya’ 
for some y € K*, and f’o f~! is an automorphism of EF, so there is a 
unit € € R}, such that f’(z) = f(€yz). Then (ii) for f gives 


f (Hel = f(eqte4) = f(as7téqt) = f’(as7't) for al t € K/a, 


so (ii) remains true if f is replaced by f’. Hence a is independent of f. 
O 


Theorem 9.1 gives us a well-defined map 
agjp: A, — K* CC, 


and it is clear that ag; is a homomorphism. However, it is easy to see 
that ag/,(L*) # 1, so ag, is not a Gréssencharacter. More precisely, 
if 6 € L* and xg € Aj, is the corresponding idele, then [rg, L] = 1. So (9.1) 
says that a = ag/z,(xg) is the unique element of K* such that aRx = 
Nk-((xs))R« = NK(8)Rx and such that multiplication by aN;x;! in- 
duces the identity map on K/a. Clearly, the required a is just NEG. In 
other words, we have proven that 


On/1(tp) = NKB for all 8 € L*, 


which is the first step in proving the following important result. 
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Theorem 9.2. Let E/L be an elliptic curve with complex multiplication 
by the ring of integers Rx of K, assume that L D K, and let ag, : Ai, > 
K* be the map described in (9.1). For any idele s € Aj, let so. € C* 
be the component of s corresponding to the unique archimedean absolute 
value on K. Define a map 


bey: A —C*, bet (&) = agi (2)NK(27")oo. 


(a) Wez is a Grossencharacter of L. 

(b) Let $8 be a prime of L. Then yg 1, is unramified at PB if and only if E 
has good reduction at 8. (Recall that a Grdssencharacter wy : Aj, — C* is 
said to be unramified at $B if b( Ry) = 1.) 


PROOF. (a) It is clear that 7, /z, is a homomorphism. We saw above that 
if 6 ¢ L*, then agyz(rg) = NEB. On the other hand, untwisting the 
definitions we find 


Nk (2a) = Il 8" =NKB. 
TLC 
teed 


Therefore ~p/, (tg) = 1. This holds for all 6, so wg/,(L*) = 1. 

Next we are going to verify that ag/z, : Ay, — C is continuous. Fix an 
integer m > 3. We know from (2.3) that L(E[m]) is a finite abelian exten- 
sion of L. Let By, C Aj, be the open subgroup corresponding to L(E(m]); 
that is, B,, is the subgroup so that the reciprocity map induces an isomor- 
phism 

1/Bm —> Gal(L(E[m])/L) 


wo be cet 
Let 
Wm = {8 € Ak : 8 € RZ and s,=1(modmRp) for all p}, 
and let 
Um = Bm {x € Ay : NKZ E Wm}. 
We note that U,, is an open subgroup Aj. We are going to prove the 


Claim: agy/r(z)=1 for all x € Um. 


Let x € Um, and to ease notation let a = ag/,(x). Also fix an analytic 
isomorphism 


f:C/a > E(C) 
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as in (9.1). Then for any t € m~!'a/a we have f(t) € E[m], so 


f(t) = fpr since z € Bm, so [z, L] fixes L(E[m]) 
=f(aNga7't) from (9.1ii) 
= f(at) since t € m7'a/a and 


(Nkz), € (1+ mR,)/ Rj for all p. 
Hence multiplication by a fixes m~'a/a or, equivalently, 
(a—1)m-laca. 

This inclusion of fractional ideals means that (a — 1)Rx C mMRx, so 

aé Rr and a = 1(mod mRx). 
On the other hand, for any prime p of K we have 
ord, a = ord, (NK2), from (9.1i) 
=1 since the p-component of Nk-x € Wy, is a unit. 


This holds for all p, so a must be a unit, a € R%. But a = 1(mod mRx) 
from above, so the only possibility is a = 1. This proves our claim. 
It follows from the claim and the definition of wz,z, that 


Wei(t)=NK(z)oo for all  € Um. 


From this formula it is clear that wg; is continuous on Up. But Um is 
an open subgroup of Aj. Therefore y,/ 1 is continuous on all of Aj, which 
completes the proof that wz /;, is a Grdéssencharacter. 
(b) Let 1%? C Gal(L*®/L) be the inertia group for B. The reciprocity map 
sends Ry, to IZ, 

[Fis, L] = IEP, 
where we embed Rg, into Az, in the usual way, 


Ry — Aj, utr— |[...,1,1,u,1,1,...]. 


§8 — component 


Let m be an integer with $B {m. We know from (2.3) that E[m] c 
E(L?»), so Ig? will act on E[m]. We want to characterize when this action 
is trivial in terms of values of the Gréssencharacter wz/,,. Thus 


Tg> acts trivially eS 
3B o ab 1 
on: Blea <= fit) f(t) for all o € I and all t € m~‘a/a 
<=> f(t)l™4l = f(t) for all x € Ry and allt € m~‘a/a 

— f(ozt(2)(Nka“!)t) = f(t) 
for all x € R and all t € m~*a/a, 
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where for the last equivalence we have used (9.1). We make two observa- 
tions. First, 

We/L(x) = ag/L(z) for all x € Ry, 
since the archimedean components of x € Ry are all 1. Second, multipli- 


cation by N&-x2~+ induces the identity map on m~!a/a. This follows from 
Lemma 9.3 (see below) and the assumption that 8 fm. Hence we find 


<= f(veyr(z)t) = f(t) 
for all x € Ry, and allt € mt a/a 
<= be/L(2) = 1(mod mRx) 


I acts trivially 
on E|[m| 


for all « € Ri, since f :m7'a/a — E[m]. 


Next we apply the criterion of Néron-Ogg-Shafarevich, which relates 
the action of Ip on E[m] to the reduction of E modulo %. More pre- 
cisely, [AEC VIL.7.1] says that 


13? acts trivially on E[m] for 


<=> E has good reduction at $B. 
infinitely many m prime to P% 1as g eduction at $B 


Combining this with the equivalence proved above, we obtain the desired 
result: 


E has good there are infinitely many m with 8 { m such that 
reduction at pai VpyjL(@) =1(modmRx) for all r € Ry 
<= ve/i(z)=1 forallze Ry 


<=> wer is unramified at $B. 
O 


It remains to prove the elementary result used in the proof of (9.2b). 


Lemma 9.3. Let a be a fractional ideal and 6 an integral ideal of K. 
Let s € Aj, be an idele with the property that 


Sp =1 for all primes p dividing 6. 


Then the multiplication-by-s map s : K/a — K/a induces the identity map 
on b~'a/a. In other words, 


st=t forall t€b'a/a. 


Proor. From (8.1), the decomposition of 6~'a/a into p-primary compo- 
nents is 


b-ta/a —> GB (b-'a/a), = Db; 'a,/a, = Db, *a,/ap. 
P p 


p|b 
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Here the last equality follows from the fact that 6b) = Rp for all p { 6, 
so the only non-zero terms in the direct sum are those with p|b. The 
multiplication-by-s map on b~'a/a is now defined by the commutative 
diagram 

b-ta/a => b~la/a 


[ I 
AD by tap /ap — A by "ap /ay. 


plo plo 
(tp) ea (Sptp) 
But by assumption, s, = 1 for all p|b, so multiplication-by-s is just the 
identity map. Oo 


§10. The L-Series Attached to a CM Elliptic Curve 


The L-series attached to an elliptic curve is an analytic function that is 
used to encode arithmetic information about the curve. One then hopes 
to deduce further arithmetic properties of the elliptic curve by studying 
the analytic properties of its [-series, much as one uses the Riemann zeta 
function to study the set of rational primes. In this section we will define 
the Z-series of an elliptic curve FE and show that if EF has complex multipli- 
cation, then its L-series can be expressed in terms of Hecke L-series with 
Gréssencharacter. 

Let L/Q be a number field, and let E/L be an elliptic curve. For each 
prime $B of LZ, let 

Fa = residue field of L at $, 


ap = NOP = #F y. 
If E has good reduction at $B, we define 
ay = Gp +1— #E(Fy), 
Ly(E/L,T) =1-apT + qpT?. 
The polynomial Lp (£/L,T) is called the local L-series of E at $B. If E has 


bad reduction at $B, we define the local L-series according to the following 
rules: 


1—T if & has split multiplicative reduction at P, 
_ j)1+T° if & has non-split multiplicative 

ERE Y= reduction at $B, 
1 if & has additive reduction at PB. 
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Remark 10.1. In the case that E has good reduction at %, we can give 
a more intrinsic definition of the local L-factor in terms of the action of 
Frobenius on the Tate module. Thus let dy : E = E be the Qxp-power 
Frobenius map, and let 


ope 7 Te(E) Se Te(E) 


be the associated map on the Tate module of E (see [AEC III. §7]), where 
we take some # relatively prime to the characteristic of Fy. If we choose a 
basis for the Tate module, so Te(E) = Ze x Zp, then dp ¢ is represented by 
a 2x 2 matrix with coefficients in Zp. The characteristic polynomial of the 


linear transformation ¢¢ is 
det (1 —- on eT) = (tr op e)T + (det bsp e)T? € Z,(T), 
and this polynomial is independent of the chosen basis for T;(E). 
In fact, [AEC V.2.3] says that the characteristic polynomial of da ¢ 


has coefficients in Z and is independent of ¢. More precisely, we find that 


trope = 1+degd—deg(1—¢) from [AEC V.2.3] 


=1+q—#E(Fx) from [AEC II.2.11c] and [AEC V §1]. 
Similarly, 
det dp,¢=degd from [AEC V.2.3] 
= W from [AEC II.2.11c]. 
Hence 


Ly(E£/L,T) = det(1 — dp eT). 


For a general discussion of this material in terms of -adic cohomology, see 
Hartshorne [1, App. C]. 


We now piece together the local L-factors to form the global L-series 
of E. 


Definition. The (global) L-series of E/L is defined by the Euler product 


L(E/L,s) = || Lp(E/L, 95°)", 
P 


where the product is over all primes of L. 


Using the estimate |am| < 2,/@p from [AEC V.2.4], it is not hard to 
show that the product converges and gives an analytic function for all s 
satisfying Re(s) > 3. Conjecturally, far more is true. 


§10. The L-Series Attached to a CM Elliptic Curve 173 


Conjecture 10.2. Let E/L be an elliptic curve defined over a num- 
ber field. The L-series of E/L has an analytic continuation to the entire 
complex plane and satisfies a functional equation relating its values at s 
and 2 — s. 


We are going to verify this conjecture for elliptic curves with complex 
multiplication by showing that L(E/L,s) is a product of Hecke L-series 
with Gréssencharacter. In general, suppose that 

wy: A; — C 


is a Gréssencharacter on L; that is, 7) is a continuous homomorphism which 
is trivial on L*. Let $8 be a prime of LZ at which w is unramified, so 
~(Ra) = 1. We then define Y($) to be 


p(B) =n pl. bth | 1, an 1, 1, nikal )s 
3B — component 


where 7 is a uniformizer at %. Note that since w is unramified at PB, o(PB) 
is well-defined independent of the choice of 7. For convenience, we also set 


w(B) =0 if w is ramified at B. 
Definition. The Hecke L-series attached to the Grdssencharacter 
y: A, — C* 
is defined by the Euler product 


L(s,b) =] - vi®B)ag") 
BY 


where the product is over all primes of L. 


Hecke L-series with Grdssencharacter have the following important 
properties, whose proof we will omit. 


Theorem 10.3. (Hecke) Let L(s,w) be the Hecke L-series attached to 
the Grdssencharacter w. Then L(s,7) has an analytic continuation to the 
entire complex plane. Further, there is a functional equation relating the 
values of L(s,%) and L(N — s,%) for some real number N = N(q). 


ProoF. This was originally proven by Hecke. It was reformulated and 
reproven by Tate [8] using Fourier analysis on the adele ring A;. o 


The key to expressing L(E'/L, s) in terms of Hecke L-series is to express 
the number of points in E(Fq) in terms of the Gréssencharacter attached 
to E/L. 
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Proposition 10.4. Let E/L be an elliptic curve with complex multipli- 
cation by the ring of integers Rx of K, and assume that L D> K. Let B 
be a prime of L at which E has good reduction, let E be the reduction 
of E modulo $, and let dg : E — E be the associated qsg~power Frobenius 
map. Finally, let ~gj_ : Aj, — C* be the Gréssencharacter (9.2) attached 
to E/L. Then the following diagram commutes: 


(ve/1(B)] 
E ——— E 
| | 
Be Ses, 


where the vertical maps are reduction modulo {. 


Proor. Before we begin the proof, two remarks are in order. First, ¢/1, 
is unramified at $B from (9.2b), so Wz/z,(PB) is well-defined. Second, since 
wet (PB) is the value of gz, at an idele with 1’s in its archimedean com- 
ponents, we have g/_(8) = az_(B) € Rx, so it makes sense to talk 
about [we / 1(B)] as an endomorphism of E. 

Let x € Aj, be an idele with a uniformizer in its $-component and 1’s 
elsewhere. Then as we just remarked, 


Wey/i(B) = vest(z) = any1(x) € Re. 


The commutative diagram (9.1) used to define ag, tells us that 


f(t) 4) = [wey(x)|f (Nka7't) for allt € K/a. 


Fix some integer m with % { m. Then (9.3) says that Nka2~!t = ¢t for 
all t € m~!a/a, so we get 


f(t) 4) = [deyr(a)| f(t) for all t € m~1a/a. 


Now consider what happens when we reduce modulo 8. We have [z, L] = 
(98, L?/L) from (3.5), so [z, L] reduces to the gq-power Frobenius map. 
Hence 


én (7) = FO = [beyr(a)|F) for all temo a/a. 


Since this is true for all m prime to 8, and since an endomorphism of E 
is determined by its effect on torsion (or even on é-primary torsion for a 
fixed prime £ [AEC III.7.4]), we conclude that 


op = [vx/1(2)]- 


§10. The L-Series Attached to a CM Elliptic Curve 


Corollary 10.4.1. With notation as in (10.4), we have 


(a) ap = NOP = NO (vz/1(P)); 
(b) #E(Fp) = NOB +1 -— vey. (P) — vey .(B): 
(c) ag = Ve/L(P) + vet (FP). 


(The bar indicates complex conjugation of elements of K.) 


PROOF. (a) We compute 
NOP = deg dp from [AEC II.2.11c] 


= deg [bey (B)| from (10.4) 
= deg[¥z/1(B)| from (4.4) 
=N&(ve/(8)) from (1.5). 


(b) Similarly, we compute 
#E(Fp) = #ker(1 — dp) 


= deg(1 — dx) from [AEC III.5.5] 
and [AEC III.4.10c] 


deg [1 —be/1(B)| from (10.4) 
deg [1 — vex (B)| from (4.4) 
@ 
1 


N& (1 — ve/1(B)) _ from (1.5) 


( 
1 


— Pe/t(B)) (1 — vet (®)) 
— bei (P) — vet (P) +NGB from (a). 


(c) Immediate from (a), (b), and the definition of ag. 


175 


QO 


We now have all of the tools needed to relate the L-series of E to 
the L-series attached to its Grdéssencharacter, at least in the case that the 
field of definition of FE contains the CM field. We will leave the proof in 


the other case to the reader. 


Theorem 10.5. (Deuring) Let E/L be an elliptic curve with complex 


multiplication by the ring of integers Rx of K. 


(a) Assume that K is contained in L. Let Wg, : Ay, — C* be the Grés- 


sencharacter (9.2) attached to E/L. Then 


L(E/L,s) = L(s, vei) L(s, Pez): 
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(b) Suppose that K is not contained in L, and let L’ = LK. Further 
let Weji : Ay, — C* be the Grdssencharacter attached to E/L’. Then 


L(E/L,s) = L(s,%p/1’)- 


Using Hecke’s theorem (10.3), we immediately deduce that the L-series 
of a CM elliptic curve has an analytic continuation and satisfies a functional 
equation. A more careful analysis yields the following result. We will leave 
the proof to the reader. 


Corollary 10.5.1. Let E/L be an elliptic curve with complex multiplica- 
tion by the ring of integers Rx of K. The L-series of FE admits an analytic 
continuation to the entire complex plane and satisfies a functional equation 
relating its values at s and 2 — s. 

More precisely, define a function A(E/L, s) as follows: 
(i) IfK CL, let 


A(B/L, 8) = (N&(Dr/aew))* ((2m)~*T(8)) I L(E/L, 8), 


where cy, is the conductor of the Grdssencharacter Wz/1, D1/g is the dif- 
ferent of L/Q, and I(s) = fe t®—le~* dt is the usual T-function. 
(ii) If kK ¢ L, let L' = LK and 


A(E/L, 8) = (N5i (D1jac),))/” (2m) "1 (s)) “I L(B/L, 8); 


where cy is the conductor of the Grdssencharacter fr’. 
Then A satisfies the functional equation 


A(E/L, s) = wA(E/L,2—s), 
where the quantity w = wx, € {+1} is called the sign of the functional 
equation of E/L. 


ProoF (of Theorem 10.5). We know from (6.1) and [AEC VIL.5.5] that E 
has potential good reduction at every prime of L, so [AEC VII.5.4(b)] tells 
us that & has no multiplicative reduction. Hence 


_ fl—apT + pT? if E has good reduction at P, 
Pete et) { i if E has bad reduction at P. 


Now suppose £ has good reduction at $8. Then 


Dg(E/L,T) =1—agyT + qpT? by definition of Lx, 


=1- (dei (P) + ve/r(B))T + (NG Vey (P))T? 
from (10.4.1) 


= (1 — vei (B)T) (1 — Ye BT). 
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On the other hand, (9.2b) says that ~g,z is unramified at P if and only 
if E has good reduction at 5B, and the same is true for {g/_. Thus 


be L(B) = ve (BP) =0 if E has bad reduction at , 


so the formula given above for Ly(E£/L,T) is also true for primes of bad 
reduction, since it reduces to Ly(E/L,T) = 1. Therefore 


L(B/L, 8) = ]] Lp B/L, 45°) * 
Bo 
= ute = be/t(B)ay°) (1 a Da Bay) 
Zp 


= L(s, be/1)L(s, be/z)- 
(b) See exercises 2.30, 2.31, and 2.32. Oo 


Example 10.6. Let D € Z be a non-zero integer, and let E be the elliptic 
curve 
E:y=224+D 


having complex multiplication by the ring of integers Rx of the field K = 
Q(V—3). Let p be a prime of Rx with p { 6D. Since Rx is a PID, we 
can write p = (7), and one can check that there is a unique 7 generating p 
which satisfies 7 = 2(mod 3). It is then a moderately difficult exercise 
using Jacobi sums (see Ireland-Rosen [1, 18 §§5,7|) to show that 


Tv as 


#E(Fp) =N6pt+1+ (=) m+ (2) i, 
6 6 


where (£), is the 6''-power residue symbol; that is, (2), is the 6''-root of 


us us 


unity satisfying 
(Ng P-1)/6 = (2) (mod 7). 
TI6 


Using (10.4.1), we see that the Gréssencharacter attached to E is given 
either by 


4D 
Ve/K(p) = — (2) nm  orelseby = We/xK(p) = — (2) 7. 
6 T /¢ 


To determine which one it is, we use (5.4) to find a root of unity € € 
Rix such that the reduction of [€7] modulo p is NG p-power Frobenius. 
Note that (5.4) says this is possible for almost all degree 1 primes of K. 


178 II. Complex Multiplication 


But (10.4) says that [v~/%(p)] also reduces to Frobenius. We conclude 
that 


4D 
WesK(p) = -(=) Tr, where p = (7) and m = 2(mod 3), 
6 


at least for almost all degree 1 primes p of K. By the continuity of yw and 
the reciprocity law for (ps we see that this formula holds for all p. 

Using (10.5) and NOt = ni, we find that the L-series of E over K 
and over Q can be written explicitly using residue symbols as 


L(E/K.s)= [I (1+ (2)n) 


nERK prime 
m=2 (mod 3) 


L(E/Qs)= JJ (1+ (Zl) 


awERK prime 
mw=2 (mod 3) 


EXERCISES 


2.1. Let K/Q be a quadratic field with ring of integers Rx, and let R C K be 
an order in K. Prove that there is a unique integer f > 1 such that 


R=Z4+f-RxK. 


The integer f is called conductor of the order R. 
2.2. “Let A = Z[i] be the lattice of Gaussian integers, and let » be the elliptic 


integral 
_ / hae 
o Vi-t 
(a) Prove that 
go(A) = 64\*. 


(b) More generally, prove that for all integers n > 1, Gan(A) is a rational 
number multiplied by \*”. 

2.3. Let K/Q be a quadratic imaginary field, and let E'/C be an elliptic curve 
with End(E) @ Q& K. Let E£’/C be another elliptic curve. Prove that E’ 
is isogenous to F if and only if End(B’) @Q= K. 
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2.4. 


2.5. 


2.6. 


2.7. 


2.8. 


2.9. 


Let E be an elliptic curve defined over a number field Z with complex 
multiplication by K, and let % be a prime of L of characteristic p at which E 
has good ordinary reduction. Prove that Q, contains a subfield isomorphic 
to Kk. 

Let E/Q be an elliptic curve with complex multiplication by the ring of 
integers in Q (/-7 iF Without using an explicit Weierstrass equation for E, 
prove the following two facts: 

(a) If @: E — E is an endomorphism of degree 2 and P is the non-zero 
point in the kernel of ¢, then P ¢ E(Q) and PE E (Q (/-7)). 

(b) E(Q) contains exactly one point of order 2. 

(Hint. Use (2.2a).) 

(a) Let F be a field, let G be a subgroup of GL2(F), and let C(G) be the 
centralizer of G; that is, 


C(G) = {a € Gla(F) : ay = ya for all y € Gh. 


Prove that one of the following two conditions is true. 


(i) c@={ (4 - cer} 


(ii) G is abelian. 
(Hint. If C(G) contains a non-scalar matrix a, make a change of basis to 
put a into Jordan normal form and then calculate C(a).) 
(b) Let L be a perfect field, let E/L be an elliptic curve, and let @ be a 
prime with € 4 char(L). Suppose that End ,(£) is strictly larger than Z. 
Use (a) to prove that the action of Gal(Z/L) on the Tate module Ty(E) is 
abelian. 
The following fact (2.5.1) from commutative algebra was used in the proof 
of Proposition 2.4. Let R be a Dedekind domain, let a be a fractional ideal 
of R, and let M be a torsion-free R-module. Prove that the natural map 


@: a tM — + Homar(a,M) 
£ r> (bdr: a az) 


is an isomorphism. (Hint. Prove that it is an isomorphism after you localize 
at any prime p of R. Note that the localization R, is a principal ideal 
domain.) 

Let L/K be a finite abelian extension of number fields. 

(a) Let 2 be a non-zero fractional ideal of L. Prove that 


(NK&, L/K) = 1. 


(b) Prove that an unramified prime p of K splits completely in L if and 
only if (p,L/K) = 1. 
Let a be a fractional ideal of K, and let E be an elliptic curve corresponding 
to the lattice a. 
(a) Prove that j(£) € R if and only if a? = 1 in CL(Rx). 
(b) Prove that the following are equivalent: 

(i) Q(j(E)) is Galois over Q. 

(ii) Q(j(£)) is totally real. 

(iii) Every element of CL(Rx) has order 2. 
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2.10. 


2.11. 


2.12. 


2.13. 


2.14. 


2.15. 


II. Complex Multiplication 


Let K be a number field, Rx its ring of integers, and ¢ an integral ideal 
of Rx. Prove that there is an exact sequence 


x  (RK/)” - I(c)/P() -— CL(RK) > 1. 
a(modc) + (a) 


Let K =Q (V=15 ), and let Rx be the ring of integers of K. 
(a) Prove that CL(Rx) = Z/2Z. 


(b) Let L=Q (v —3, V5 )- Prove that L/K is everywhere unramified, and 
deduce that L is the Hilbert class field H of K. 


(c) Let K =Q (V=23). Prove that the Hilbert class field of K is given 
by H = K(a), where a satisfies a? —a —1=0. 

Let E/C be an elliptic curve such that End(£) is an order in the quadratic 
imaginary field K. 


(a) Show that there exists an elliptic curve E’/C and an isogeny ¢: E — E’ 
such that End(E’) = Rx. 


(b) “If E is defined over the field L, prove that it is possible to choose E’ 
and ¢ in (a) so that both are defined over L. 

Let K = Q(t) and let Kn be the ray class field of K modulo N. 

(a) Prove that 


Ko=K, K3=K(V3), Ka =K(vV2). 


(b) Verify directly that the field 


(= 


is an abelian extension of K, and compute its Galois group over K. 
(This exercise verifies some of the statements made in (4.9.1).) 


Let E be the elliptic curve y? = 2* + 1, and let K = Q (V=3). For each 
integer N > 1, let Ky = K(h(E[N])) and Ly = K(E[n)). 

(a) Calculate K2, K3, and Ka, explicitly and in each case verify that Kn 
is the ray class field of K modulo N. 

(b) Calculate Le, L3, and L4 explicitly and show that they are abelian 
extensions of K. 


Let E be the elliptic curve E : y? = 2° + 42 + 2z, and let K =Q (V—2). 
From (2.3), this curve has complex multiplication by K. Redo the previous 
exercise for this curve E and field K. 


‘, 
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2.16. 


2.17. 


2.18. 


For each of the quadratic imaginary fields in the following table, verify that 
the given a generates the Hilbert class field of K, and calculate the value 
of j(Rx) explicitly as an element of K(a). (We have filled in the first row 
for you, see Example 6.2.2.) 


Let D7, and 87, be the sets of matrices defined by 


Di = AG ‘) € M2(Z) : ad— bc =n, gcd(a,b,c, d) = i\, 


a= {(5 )) ED, d>0,0<b<ah. 


(a) Prove that the natural map 8% —> SL2(Z)\Dj, is a bijection. 


(b) Prove that 
#8; =» TT (04 2). 


pin 
(Notice that if n is squarefree, then D7, and 8}, are just the sets D, and 8, 
considered in 86.) 
Let 87, be as in the previous exercise, and define 


X)= [] * -soa). 


aesr 


®,, is called the modular polynomial of order n. 

(a) Prove that ®, € Z[j][X]. We will write ®,(j, X) to indicate that ®, 
is a polynomial in two variables. 

(b) Prove that ®, is irreducible over C(j). 

(c) Prove that ®,(Y, X) = ®n(X,Y). 

(d) Prove that if n is not a perfect square, then ®,(X, X) is a non-constant 
polynomial with leading coefficient +1. 

(e) Let Fn(j,X) be the polynomial from (6.3). Prove that 


Fr(3,X)= J] ®nya2(9,X). 
d2|n,d>1 
In particular, if n is square-free, then F, = ®,. 
(f) *Let |®,,| denote the magnitude of the largest coefficient of ®, (Y, X). 


Prove that 
log |®n| 


ee (deg ®n) (log n) = 


182 


2.19. 


2.20. 


2.21. 


2.22. 


2.23. 


II. Complex Multiplication 


Let Fn(Y, X) be the polynomial from (6.3), and let ®,(Y, X) be the poly- 
nomial from the previous exercise. Let £,/C and F2/C be elliptic curves. 
(a) Prove that F,,(j(£1),j(E£2)) = 0 if and only if there is an isogeny 
E, — Ex of degree n. 

(b) Prove that ®,(j(£1),j(£2)) = 0 if and only if there is an isogeny 
E, — Ez whose kernel is cyclic of degree n. 


Let p be a prime. Prove Kronecker’s congruence relation 
F,(Y, X) = (X — Y?)(X? —Y) (mod pZ[X,Y]), 


where Fy, is the polynomial defined in (6.3). 


Let f(r) be a modular function of weight 0 that is holomorphic on H and 
that has the g-expansion f = )*ang”. Let R be a ring containing all of 
the an’s. Prove that f € Rij], where j = j(7T) is the modular j-function. 
This strengthens (1.4.2b), which says that f € C[j]. 


Let > 3 be a prime, let 


relerie a ee {Ae GLa(Ze): A= € :) (add o. 


and let M2(Zc) be the additive group of 2x 2 matrices with €Z¢-coefficients. 
Prove that the map 


pS. om n+lpn an 
GLa(Ze)1 — Mo(lZe), 1+ €Ar—+ log(1 + £4) = 5° a ae ee 
n=1 


is a well-defined isomorphism. 


This exercise generalizes the previous one. Let L be a finite extension of Qs, 
let R be the ring of integers of L, and let IN be the maximal ideal of R. 
For each integer r > 1 define a subgroup G, of GLn(R) by 


G, = {A€GL,(R) : A= In (mod M")}, 


where I, is the n x n identity matrix. 
(a) Prove that for every r > 1, the quotient G,/G,+1 is a finite group 
whose order is a power of @. 
(b) Prove that 
G, = lim Gi/G,, 


and deduce that G is a pro-é group. 

(c) Prove that if r is sufficiently large, then G,. is isomorphic to the additive 
group of n X n matrices with coefficients in R. (Hint. For the case n = 1, 
see [AEC IV.6.4b].) 
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2.24. 


2.25. 


2.26. 


2.27. 


Let FE and E’ be elliptic curves given by Weierstrass equations 
B:y? =2* + acx* 4+ asx + a6, EB: Y¥? = X34 aX? 4+a,X +45, 
and let ¢: E — E’ be a non-zero isogeny. 


(a) Prove that there is a rational function R(x) and a constant c such 
that ¢ has the form 


ole,y) = (202), v2) ) 


(Hint. Look at the invariant differentials.) 
(b) Prove that there is a commutative diagram 


C/A — C/A’ 


Zrrc 1 z 


| | 


EC) - BC) 


where the vertical maps are complex analytic isomorphisms and c is the 
constant from part (a). 

Let E/L be an elliptic curve defined over a number field L with com- 
plex multiplication by the ring of integers of K, and assume that K C L. 
Let x € H'(Gz/,,Aut(E)), and let EX/L be the corresponding twist 
of E. (See [AEC X §5] for basic facts about twists.) If we identify Aut(E) 
with fn [AEC III.10.2], then x gives a homomorphism 


x: Gal(L*?/L) — pn CC’, 
and we can extend x to a homomorphism on the ideles by the rule 


Prove that 
Wex/L =X WE/L- 


Let E/L be an elliptic curve defined over a number field (not necessar- 
ily with complex multiplication). Prove that the infinite product defin- 
ing the L-series L(E/L,s) converges absolutely and uniformly in the half- 
plane Re(s) > 3. 

Let E/L be an elliptic curve defined over a number field (not necessarily 
with complex multiplication), let $8 be a prime of L at which E has good 
reduction, and let Fy be the residue field at $8. For each integer n > 1, 
let Fan be the extension of Fy of degree n. Recall [AEC V §2] that the 
zeta function of E/F is the formal power series 


Z(E/By,T) = exp(S#E (Pan) ). 


n=1 
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2.28. 


2.29. 


2.30. 


II. Complex Multiplication 


(a) Prove that 
Ly(E/Fx,T) 
(1—T)Q— pT)’ 


where Ly is the local L-series described in §10 and gg = No. 
(b) The zeta function of the field L is given by the usual Euler product 


¢u(s) = [J a9), 


-p 
and the (global) zeta function of E'/L is defined by the product 


¢(E/L, 8) = || 2(E/Fp, 5°). 
By 


Z(E/Fp,T) = 


Find the “correct” definition for the factor Z(E/F,T) in the case that E 
has bad reduction at $8, and prove that 


¢(E/L, 8) = 6x (s)¢n(s — 1) L(E/L, 8)". 
With notation as in §10, prove that 


_ Ens(F 
Lp (E/Fx,qy') a ee). 


Here Ens is the non-singular part of E. Note that we do not assume E has 
good reduction at %. (See [AEC HI §2] and [AEC exer. 3.5].) 

Prove the functional equation (10.5.1) for the L-series of an elliptic curve 
with complex multiplication. (Hint. Use the functional equation for Hecke 
L-series with Gréssencharacter as described, for example, in Tate [(8].) 

In the next three exercises we sketch the proof of Theorem 10.5(b). We 
set the following notation: Let E/L be an elliptic curve with complex 
multiplication by the ring of integers Rx of K, and assume that L does 
not contain K. Let L’ = LK, so L’ is a quadratic extension of L, and let B 
be a prime of L. From (9.2) there is a Gréssencharacter wmz : Ay, — C*. 
Let gp, ag,...be the quantities described in §10. 

Assume that E has good reduction at $B. 

(a) Prove that % is unramified in L’. 

(b) Suppose $ splits in L’ as PR,» = Y'P”". Prove that 


Ip = Up = 4p" and ag = Weyi (P') + dee (P"). 
(c) Suppose 8 remains inert in L’, say PRz = YP’. Prove that 
=I, ap =O, Yeu (P') = —ay. 


(d) Let E be the reduction of E modulo $B, and let p be the residue 
characteristic of $B. Prove that 
Bi ordinary if B splits in L’ and p splits in K, 
is 
supersingular if $8 is inert in L’ and p does not split in K. 
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2.31. 


2.32. 


2.33. 


2.34. 


We continue with the notation and assumptions from above. Let $s’ be a 
prime of L’ lying over }. 

(a) If $$ ramifies in L’, prove that E has bad reduction at $. 

(b) If $$ is unramified in L’, prove that 


E has good reduction at $ <=> E has good reduction at $’. 


We continue with the notation and assumptions from above. 
(a) Prove that the local L-series of E at PB is given by 


(1 = deye (PT) — bese (PT) 

if BR, = P'S" splits in L’, 
l—wee(P)T if PRy: =P’ is inert in L’, 
1 if PR, =? ramifies in L’. 


LIy(£/L,T) = 


(b) Prove that the global L-series of E/L is given by 
L(E/L,s) = L(s, bez’). 
Fix a non-zero integer D € Z and let E be the elliptic curve 
E:y’=2°— Dz. 


Let p € Z be a prime with p{ 2D. 
(a) If p = 3 (mod 4), prove that 


#E(Fp)=pt1 and #E(F,2) =(p+1)?. 


(b) If p = 1(mod 4), factor p in Z[t] as 


p=TT with m=1 (mod 2+ 2%). 


#E(Fp)=pt1 (?) rg (2) i, 


where (¢), is the 4**-power residue symbol. 


Prove that 


Continuing with the notation from the previous exercise, let p C Z[i] be a 
prime ideal with p { 2D. Write 


p = (z) for an element 7 € Z[i] satisfying 7 = 1 (mod 2 + 22). 


Prove that the Grdssencharacter associated to E/Q(i) is given explicitly 
by the formula 


D 
Ve/ou)(p) = (2) TT. 
4 


Here z/Q(:)(p) equals the value of ~ at an idele with a uniformizer at 
the p-component and 1’s elsewhere. 
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2.35. Let E/L be an elliptic curve with complex multiplication by the ring of 
integers Rx of K, and assume that K C L. Let p be a degree | prime 
of K, let p = NOP; and let Ry be the completion of Rx at p. Notice 


that Ry = Zp. 

Fix an integer h > 1 so that p” is principal, say p” = Rx. We make 
the collection of groups E[p”], n = 1,2,..., into an inverse system using 
the maps 


[x] 
Elp"t") —> Elp"], = n=1,2,.... 
The p-adic Tate module of E is then defined to be the inverse limit 


T,(E) = lim Ep"). 


(a) Prove that T,(£) is a free Ry-module of rank 1. Deduce that Aut Tp(£) 
is isomorphic to Rp = Zp. 

(b) Let Ly be the compositum of the fields L(E[p”]) for all n > 1 or, 
equivalently, the field defined by 


Gal(L/Ly) = ker{Gal(L/L) —> Aut T,(E)}. 
Prove that there is a finite extension L’/L contained in Ly such that 


Gal(Ly/L’) & Zp. 


c) Let [ = Z,, and define a ring Z,|I'] to be the inverse limit 
P p 


def 


Zp[C] “ limZ,[0/p"T]. 


(Note that Z,[I'] is not the same as the group ring Z,[I'].) Prove that 
Rp[Gal(L,/L’)] = Zp[0], 


and hence that T,(£) is a Z,[[]-module. 

(d) Prove that Z,[I’] is isomorphic to the power series ring Zp[T]. (Hint. 
Let y € I be a topological generator, send y” to (1+ 7)", and show that 
this extends to an isomorphism.) Zp[I']-modules are often called Iwasawa 


modules. 
(e) Let L, be the field of definition of T,(£) = lim E[p”]. Prove that there 
is a finite extension L’’/L contained in Ly such that 


Gal(Lp/L") & Zp x Zp. 


(Hint. Write pRx = pp’. Show that Lp = LpLy, and that Lp M Ly is a 
finite extension of L.) 


CHAPTER III 


Elliptic Surfaces 


Elliptic surfaces appear in many guises. They are one-parameter algebraic 
families of elliptic curves, they are algebraic surfaces containing a pencil of 
elliptic curves, and they are elliptic curves over one-dimensional function 
fields. In this chapter we will see elliptic surfaces arising in all of these 
ways. Since our emphasis in this book is primarily on arithmetic questions, 
we will concentrate on those properties of elliptic surfaces which resemble 
the arithmetic properties of elliptic curves defined over number fields. This 
means we will neglect many of the fascinating geometric questions raised by 
the study of elliptic surfaces over algebraically closed fields, especially the 
classical theory of elliptic surfaces defined over C. The interested reader will 
find a nice introduction to this material in Beauville [1], Griffiths-Harris [1, 
Ch. 4, §5] and Miranda [1]. 

We will also find it necessary to restrict attention to fields of character- 
istic zero. We do this in order to apply the results from [AEC], especially 
Chapters I, IT, and III, to elliptic curves defined over the function field k(C) 
of a projective curve C/k. All of the main theorems in [AEC] were proven 
for elliptic curves F'/K under the assumption that the field K is perfect; if k 
has characteristic p > 0, then the field k(C) will certainly not be perfect. 

However, we must in all honesty point out that elliptic curves defined 
over non-perfect fields such as F,(T’) are also extensively studied. (Equiv- 
alently, one would study elliptic surfaces € — P! defined over F,.) In fact, 
since the rings F,[Z] and Z share the property that all of their residue 
fields are finite, elliptic curves defined over F,(T) behave arithmetically 
very much like elliptic curves defined over Q. Thus conjectures about el- 
liptic curves defined over Q are often first tested and proven in the easier 
setting of elliptic curves over F,(T). Notice that k(T) does not have this 
property if char(&) = 0, so we will find that the theory of elliptic surfaces 
in characteristic zero differs in some respects from the theory of elliptic 
curves over number fields. 

The main results proven in this chapter are the Mordell-Weil theo- 
rem for elliptic surfaces (6.1), two constructions of the canonical height 
pairing (4.3, 9.3), and specialization theorems for the canonical height 
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(11.1, 11.3.1) and for the homomorphism from sections to points on fibers 
(11.4). Unfortunately, it will not be possible to prove all of the background 
results we need from algebraic geometry. However, we will give a pre- 
cise statement of the results we use and give at least some indication of 
the proofs. Briefly, we will use abelian varieties and Jacobian varieties in 
§2, rational maps between varieties in §3, intersection theory and minimal 
models of surfaces in §§7 and 8, and divisors on varieties in §10. Much of 
the material we need is contained in Hartshorne [1], but in any case, we 
will give references for all assertions that we do not prove. 


§1. Elliptic Curves over Function Fields 


One way to define an elliptic surface is as a one-parameter algebraic family 
of elliptic curves. In this guise we have already seen numerous examples 
of elliptic surfaces. For example, during the proof of [AEC III.1.4(c)] we 
wrote down the elliptic curve 


36 1 


E:y? = 3 — = 
BE Se ape. a, SOTO 


with j-invariant jo. In reality, E is a family of elliptic curves, one for 
each choice of the parameter jo (except that F is singular or non-existent 
for jp = 0 and jo = 1728). Similarly, in [AEC IX §7] and [AEC X §6] we 
looked at the elliptic curves 


yi=2°+D and yi =2°+ Dz 


for varying values of D. Again these are families of elliptic curves, in this 
case parametrized by D, and each value of D other than D = 0 gives an 
elliptic curve. 

More generally, if & is any field (of characteristic not equal to 2) and 
if A(T), B(T) € k(T) are rational functions of the parameter T, then we 
can look at the family of elliptic curves 


Er :y*? = 2° + A(T)z + B(T). 
For most values of t € k we can substitute T’ = t to get an elliptic curve 
E,:y? =2° + A(t)e + Bit). 
Precisely, FE, will be an elliptic curve provided 
A(t)#oo, B(t)#o, and A(t) = —16(4A(t)? + 27B(t)?) 40. 


Later in this chapter we will pursue further this idea of an algebraic 
family of elliptic curves. But for now we want to alter our perspective a 
bit. Rather than considering the equation 


E:y=2°+A(T)z+B(T) 
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as defining a family of elliptic curves, one for each value of T, we will 
instead view FE as a single elliptic curve defined over the field k(T). As 
long as the discriminant 


A(T) = -16(4A(T)? + 27B(T)?) 40 in k(T), 


E will be an elliptic curve defined over the field k(T). So we will be able 
to apply much of the general theory developed in [AEC] to the elliptic 
curve E'/k(T), at least provided that the field k(T) is perfect. For this 
reason we will henceforth make the assumption: 


(k is a field of characteristic zero. | 


(For further comments about this assumption, see the introduction to this 
chapter.) 


Example 1.1.1. Consider the elliptic curve E/Q(T) given by the Weier- 
strass equation 
E:y=2-T’x+T? 


with discriminant 
A = 16T*(4T? — 27). 
This curve has the rational point 
P =(f,T) € E(Q(T)), 

and one can easily use the addition law to compute 

IPP = ar SP) 

3P = (at a ee 

(LT — 3)? (F3)? 
If we substitute T = t for some t € Q, then we will obtain an elliptic 
curve FE, unless t = 0 or t = +33. 
Example 1.1.2. The elliptic curve 
E:ys2°+(T?-1)24+T? 

has many rational points defined over Q(T), such as the point (—1,0) of 


order 2 and the point (0,7) of infinite order. However, if we replace Q 
by Q (v2 be we find a new point of infinite order: 


(1, V2) € E(Q(v2)(T)). 
In general, if E/Q(T) is a non-constant elliptic curve (i.e., j(E) ¢ Q), then 
there exists a finite extension k/Q so that 
E(k(T)) = E(C(T)). 
In practice it is often difficult to find k. See Kuwata [1,2] and exercise 3.17. 
Shioda [1] has constructed an interesting example for which the action 


of Gal(k/Q) on E(k(T)) is a representation of type Eg, so [k : Q] may be 
quite large. 
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We have been taking the coefficients A and B of the elliptic curve 
E:y=2°+Ar+B 


to lie in the field of rational functions k(T). This is unnecessarily restrictive. 
We observe that k(T) is the function field of the projective line P!, so we 
might consider choosing A and B from the function field of some other 
curve. Thus we can take a non-singular projective curve C/k and look at 
elliptic curves F defined over the field k(C). 


Example 1.1.3. Let C’/Q be the (elliptic) curve 
C2852 at: 


that is, C is the projective curve corresponding to this affine equation. 
Then the equation 


E:y?+(st+t—s?)ry + s(s —1)(s —t)t?y = 2? + s(s — 1)(s — t)tx? 


defines an elliptic curve & over the function field Q(C) of C. Notice that E 
contains the rational point 


P = (0,0) € E(Q(C)). 


It is not hard to verify (at least if you have access to a computer with a 
symbolic processor) that 


[11JP =O, 


so P is a point of order 11. 

In fact, E is in some sense the universal family of elliptic curves con- 
taining a point of order 11. This means the following: Let A be any 
elliptic curve and Q € A a point of order 11. Then there is a unique 
point (so,to) € C such that if we substitute (s,t) = (so, to) into the equa- 
tions for E and P, we will obtain an elliptic curve Eo and a point Po € Eo 
of order 11 such that there is an isomorphism ¢: A — Eo with ¢(Q) = Po. 
In the literature, the curve C is called the modular curve X(11), and FE is 
usually denoted £)(11). For further details, see exercise 3.2. 
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§2. The Weak Mordell-Weil Theorem 


Our task in this section is to prove the following weak Mordell-Weil theorem 
for elliptic curves defined over function fields. We emphasize again our 
assumption that the constant field k always has characteristic 0. 


Theorem 2.1. (Weak Mordell-Weil Theorem) Let k be an algebraically 
closed field, let K = k(C) be the function field of a curve, and let E/K be 
an elliptic curve. Then the quotient group E(K)/2E(K) is finite. 


The theory of function fields of curves is analogous to the theory of 
number fields, and the proof of the weak Mordell-Weil theorem is similar 
in both cases. In other words, we could save space here by merely quoting 
the proof given in [AEC, VIII §1] with the words “number field” replaced 
by the words “function field.” However, there are enough differences that 
we feel it is worthwhile giving the proof. We will place our main emphasis 
on highlighting the differences between the two cases. 

Recall that the proof for number fields has two main steps. The first 
step [AEC, VIII.1.5] depends on properties of the elliptic curve E/K. It 
says that the extension field L = K([m|~'E(K)) is an abelian extension 
of K, has exponent m, and is unramified outside a certain finite set of 
primes S. This step carries over word-for-word to the function field case 
once one has developed the theory of valuations. We will give a slight 
variant of the argument in [AEC, VIII §1], using divisors supported on a 
finite set of points, but the reader will have no trouble seeing that this is 
merely a matter of using geometric language to deal with the same ideas. 
We will also consider only the case m = 2, since this allows us to work 
more directly with the equation for F/K. 

The second step [AEC, VIII.1.6] has nothing to do with elliptic curves. 
Instead, one uses Kummer theory to show that the maximal abelian ex- 
tension of K of exponent m unramified outside of S is a finite extension. 
The proof of this proposition is not hard, but it ultimately relies upon the 
two fundamental finiteness theorems of algebraic number theory, namely 
the finiteness of the class group and the finite generation of the unit group. 
In general, neither of these last two results is true for function fields. 

For example, if K = k(C) is a function field, then the “unit group” 
in K* is the constant field k*. To see that this is the right analogy, note 
that for a number field A, the unit group can be described as the set of 
elements a € K* satisfying v(a@) = 0 for all discrete valuations on K*. 
But the discrete valuations on a function field K = k(C) correspond to the 
points of C(k) (at least if k is algebraically closed). Thus if a function f € K 
has valuation 0 for all valuations, then it has no zeros or poles on C, so 
by [AEC, II.1.2] it is a constant. Hence the “unit group” of K will be k*, 
and in general k* will not be finitely generated. 

Similarly, we will see during the proof of the weak Mordell-Weil theo- 
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rem that the “ideal class group” of a function field K = k(C) is the Picard 
group Pic(C), that is, the group of divisors modulo linear equivalence. The 
Picard group need not be finitely generated; see (2.6) below. 

However, all is not lost. A closer examination of [AEC, VIII.1.6] shows 
that it does not require the full strength of the finiteness theorems. In- 
stead we used the facts that the ideal class group has only finitely many 
elements of order m and the unit group R* has the property that the quo- 
tient R*/R*™ is finite. These weaker results are true for function fields 
under appropriate assumptions on the constant field k of K. For exam- 
ple, if k is algebraically closed, then k*/k*”™ is certainly finite, since it is 
actually trivial. 

Similarly, the Picard group Pic(C) has only finitely many elements of 
order m. Unfortunately, the proof of this last statement requires results 
from the theory of Jacobians and abelian varieties which we will not be 
able to develop in full. So we will just state here the proposition that we 
need and postpone until the end of the section a sketch of the proof. For 
the proof of the weak Mordell-Weil theorem (2.1), we will only need to use 
the m = 2 case of the following proposition. 


Proposition 2.2. Let C be a non-singular projective curve defined over 
an algebraically closed field k. Then for any integer m > 1, the Picard 
group Pic(C) has only finitely many elements of order m. 


PROOF. See (2.7) at the end of this section for a complete description of 
the torsion subgroup of Pic(C). In particular, if C has genus g, then (2.7) 
implies that Pic(C)[m] is isomorphic to (Z/mZ)?9. Oo 


PROOF (of Theorem 2.1). Our first observation is that if L/K is a fi- 
nite Galois extension and if we can prove that E(L)/2E(L) is finite, then 
it will follow that E(k)/2E(K) is also finite. This is the content of 
[AEC, VIII.1.1.1], and we gave another proof using Galois cohomology 
in [AEC, VIII §2]. We leave it to the reader to verify that these proofs 
made no use of the assumption that the field K is a number field, so they 
are also valid in our case. Note that [AEC, II.2.5] ensures that any such L 
will be the function field of a curve over k. 

Replacing K by a finite extension and C with the corresponding curve, 
it thus suffices to prove (2.1) under the assumption that E(k) contains 
all of the points of order 2. Equivalently, we may assume that F has a 
Weierstrass equation of the form 


E:y? = (x —1)(x — e2)(x — es) with e€1,€2,€3 € K. 
Consider the map 


6: E(K)/2E(K) — (K*/K*?) x (K*/K*”) 
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ee if x # €1,€2, 

((e1 — e3)(€1 — €2),€1 —€2)) if =e, 

(e2 — €1, (€2 —e3)(e2-€1)) if =e, 

(1,1) if =0oo (P=O). 


o:P=(x,y)— 


In the case that K is a number field, we proved in [AEC, X.1.4] that ¢ 
is an injective homomorphism, and the same proof works for an arbitrary 
field Kk. 

The map ¢ can also be defined using group cohomology. We briefly 
sketch the proof. Taking Gg/,-cohomology of the exact sequence 


0— ED] — E(k) 2. eu) 0 
gives an exact sequence 
(2 6 
E(K) —> E(K) — H*(Gg/x, E[2)). 
Now our assumption that E[2] C E(K) implies that there are isomorphisms 


A" (Gx x, E[2|) = H' (Gx/x,(Z/2Z)’) = Hom(Gz x, (Z/2Z)”) 
= Hom(Gx/x,Z/2Z)? = Hom(Gxz/«, M2)? ~ H'(Gx/K, M2)”. 


Finally, the Kummer sequence for fields and Hilbert’s theorem 90 give an 
isomorphism [AEC, VIII.2.2] 


K*/K? 5 HG gq, M2): 
Combining the above maps, we obtain an injective homomorphism 


E(K)/2E(K) > H"\(Gx/x, E[2]) © H(Gx/x, M2)” 
oS (K*/K*) x (K*/ KK"). 


Of course, one still needs to unscramble the connecting homomorphisms 
and check that this map is the same as the map ¢ defined above. See 
[AEC, X §]1] for the details. 

We will prove that E(K)/2E(K) is finite by proving that the im- 
age ¢(E(K)/2E(K)) is finite. The basic idea is to show that for any P € 
E(K), the two coordinates of ¢(P) are almost squares in K. The following 
lemma quantifies this assertion. 
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Lemma 2.3.1. Let k be an algebraically closed field, let K = k(C) be 
the function field of a curve, let E/K be an elliptic curve, and suppose 
that E has a Weierstrass equation of the form 

E: y? = (x@ — e1)(x — e2)(xz — e3) with €1,€2,e3 € K. 
Let S C C be the set of points where any one of e€1,€2,€3 has a pole, 
together with the points where A = (e, — €2)*(e, — e3)*(e2 — e3)* vanishes. 
Then for any point P = (x,y) € E(K) with « #e,, 

ord;(a — e,) = 0 (mod 2) for allt € C witht € S. 

Here ord; : k(C)* — Z is the normalized valuation on k(C) which measures 
the order of vanishing of a function at t [AEC II §1]. 


Proor. Let t € C with t ¢ S, and let n = ord;(a — e,). Our choice of S$ 
implies that ord;(e;) > 0. We consider three cases. First, if n = 0, then 
clearly ord;(x — e,) = n = 0 (mod 2). 

Second, ifn < 0, then t must be a pole of x and n = ord;(z). It follows 
that 

n = ord,(a — e;) = ord;(x — e2) = ord; (x — eg). 
Using the Weierstrass equation for EF’, we find that 
2ord:(y) = ord:(y*) = ord: ((x — e1)(x — e2)(x — e3)) = 3n, 

which proves that ord;(z — e1) = n = 0 (mod 2). 

Third, suppose that n > 0. This means that x — e; vanishes at t. We 
claim that x — eg and x — e3 do not vanish at t. To see this, we let 7 = 2 
or 3 and use the triangle inequality to compute 


min{ord;(a—e1), ord;(t—e;)} < ord; ((a—e1)—(#—e;)) = ord; (e;—e1) = 0. 
The last equality follows from the assumption that e1, €2,e3 do not have 


poles at t and A does not vanish at t. But ord;(z — e,) = n > 1, so we 
get ord,(x — €2) = ord;(x — e3) = 0. Therefore 


2 ord;(y) = ord:(y”) = ord: ((a — e1)(a — €2)(a — €3)) = ord; (a — e)). 
Hence ord;(x — e,) = 0(mod 2), which completes the proof of (2.3.1). 


QO 


We now resume the proof of (2.1). Let S be as in the statement 

of (2.3.1), and define a subgroup of K*/K*? by 
K(S,2) ={f € K*/K*? : ord;(f) = 0 (mod 2) for all t ¢ S}. 
(Compare with [AEC, X.1.4].) Then (2.3.1) tells us that the image of ¢ 
lies in K(S,2) x K(S,2), so we have an injective homomorphism 
@: E(K)/2E(K) — K(S,2) x K(S,2). 

In order to complete the proof of (2.1), it suffices to prove that the group 
K(S,2) is finite. Note that the finiteness of K’(S,2) is an assertion about 
the curve C; it has nothing to do with the elliptic curve E. We record this 
statement in the following lemma, whose proof also completes the proof of 
the weak Mordell-Weil theorem (2.1). 
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Lemma 2.3.2. Let k be an algebraically closed field, let K = k(C’) be the 
function field of a curve, let S C C be a finite set of points, and let m > 1 
be an integer. Then the group 


K(S,m) ={f € K*/K*™ : ord;(f) =0(mod m) for allt ¢ S} 
is a finite subgroup of K*/K*™. 
Proor. Let s = #5. Then there is an exact sequence 


0 — K(@,m) — K(S,m) —  (Z/mZ)°. 
ae (ord:(f)) res 


It thus suffices to prove that 
K(0,m) = {f € K*/K*™ : ord;(f) = 0 (mod m) for all t € ch 


is finite. 
Let f (mod K*™) € K(@,m). Then div(f) has the form 


div(f) =mDy for some Dy € Div(C). 


Notice that if we take some other representative fg™ for the coset of f 
in K*/K*™, then Dygm = Dy + div(g) changes by a principal divisor. 
Thus the divisor class of Dy in Pic(C) is independent of the choice of 
representative. Further, mDy = div(f) is itself principal, so we get a well- 
defined homomorphism 


K(0,m) — Pic(C)[m], f (mod K*™) + class(Dy), 


where Pic(C)|m] denotes the elements of Pic(C’) of order dividing m. 

Now suppose that f (mod K*™) is in the kernel of this homomorphism. 
Then Dy is principal, say Ds = div(F) for some function Fy € K*, which 
means that 


div(fF;™) = div(f) — mdiv(F) = div(f) — mDy = 0. 


Thus fF; has no zeros or poles, so [AEC, II.3.1] tells us that it is con- 
stant. Using the assumption that k is algebraically closed, we can write this 
constant asc”, so f = (cF;)™. In other words, f = 0(mod K*”™). This 
proves that the homomorphism K(0,m) — Pic(C)[m] is injective. Finally, 
we use the fact (2.2) that Pic(C)|m] is finite to conclude that K(@,m) is 
finite. oO 


In the remainder of this section we briefly discuss abelian varieties and 
Jacobians, including a sketch of the proof that the Picard group of a curve 
has only finitely many elements of a given order. 
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Definition. An abelian variety consists of a non-singular projective vari- 
ety A, a point O € A, and two morphisms 


p:AxA—->A, i: A—A, 


which make the points of A into an abelian group. In other words, 
(i) w(O, P) = u(P,O) = P for all P € A, 
(ii) w(P,i(P)) = O for all PE A, 
(iii) w(u(P, Q), R) = u(P, u(Q, R)) for all P,Q, RE A, 
(iv) u(P,Q) = u(Q, P) for all P,Q € A. 
If A, yw, and i are defined over a field k, and O € A(k), then we say that A 
is defined over k. Basic references for the theory of abelian varieties are 


Griffiths-Harris [1, Ch. 2, §6], Milne [2], and Mumford [1], but we will need 
very little of the general theory. 


Example 2.4.1. An elliptic curve is an abelian variety of dimension one. 
The fact that the addition and negation operations on an elliptic curve 
satisfy (i)-(iv) is [AEC, III.2.2], and the fact that they are morphisms 
is [AEC, III.3.6]. Conversely, every abelian variety of dimension one is an 
elliptic curve, see exercise 3.5. 


Example 2.4.2. Let A be an abelian variety of dimension d defined over C. 
Then one can show that there is a lattice A C C4? and a complex analytic iso- 
morphism A(C) & C4/A. By lattice we mean a full sublattice, that is, a free 
subgroup of C? of rank 2d which contains an R-basis for C¢. The isomor- 
phism A(C) & C4/A is an isomorphism both as complex manifolds and as 
abelian groups. For elliptic curves, the existence of this isomorphism is the 
essential content of the uniformization theorem [AEC, VI.5.1.1]. Further, 
in the one dimensional case every lattice A Cc C corresponds to an elliptic 
curve [AEC, VI.3.6]. But in higher dimensions, a complex torus C?/A will 
only be isomorphic to an abelian variety if the lattice A satisfies the Rie- 
mann conditions (see Griffiths-Harris [1, Ch. 2, §6]). In other words, there 
are certain restrictions on the lattice A in order for there to be a complex- 
analytic embedding of C4/A into projective space P”(C). Just as in the 
case of elliptic curves, this complex uniformization of abelian varieties is 
very useful in analyzing the group structure of A(C). 


Remark 2.5. A more succinct way to define abelian varieties is to say that 
an abelian variety is a group variety in the category of projective varieties. 
It turns out that the group law forces A to be non-singular. Further, the 
completeness of the variety A forces the group law to be commutative, so 
we actually did not need to include property (iv) in our definition. See 
Mumford [1, pp. 1, 41, 44] for three proofs of this fact. 


The next proposition tells us that the Picard group of a curve is es- 
sentially an abelian variety, called the Jacobian variety of the curve. Its 
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dimension is equal to the genus of the curve. Some general references for 
Jacobian varieties include Griffiths-Harris [1, Ch. 2, §§2, 3, 7], Milne [3], 
and Mumford [2]. 


Proposition 2.6. Let C be a non-singular projective curve of genus g 
defined over an algebraically closed field k. 
(a) The degree map deg : Div(C) — Z induces an exact sequence 


0 — Pic°(C) — Pic(C) “8% Z — 0, 


where Pic®(C) is the group of degree 0 divisor classes on C’. 
(b) There exists an abelian variety Jac(C) of dimension g and a natural 
isomorphism of groups 


Pic®(C) > Jac(C). 


(For the meaning of “natural”, see (2.6.1) below.) Jac(C) is called the 
Jacobian variety of C. 


Proor. (a) The exact sequence merely defines the subgroup Pic®(C), so 
all we need to do is verify that the degree map is well-defined on Pic(C). 
This follows immediately from the fact that every principal divisor has 
degree 0 [AEC, 1.3.1]. 

(b) (Proof Sketch) We start with the two easy cases. First, if g = 0, then 
every divisor of degree 0 on C & P! is principal [AEC, I1.3.2]. It follows 
that Pic?(C) = 0, which is the desired result in this case. 

Next suppose that g = 1. Fixing a point O € C(k), we turn C/k into 
an elliptic curve, so C' is an abelian variety (2.4.1). Then there is a natural 
group isomorphism Pic?(C’) “> C as described in [AEC, III.3.4]. Hence C 
is its own Jacobian variety. 

For curves of higher genus, one can construct the Jacobian variety 
analytically if k = C or algebraically in general. We briefly describe both 
approaches. For the algebraic method, we fix a basepoint Po € C and 
consider the map 


bg: C9 — Pic®(C), (Pi,.--, Py) > (Pi) ++++ + (Py) — g(Po). 


Let the symmetric group S, act on C9 by permuting the coordinates. The 
map @, is clearly invariant under this action, so it induces a map on the 
quotient Cl def os /Sg- One checks that C is a non-singular variety, 
that ¢y : C' — Pic®(C) is surjective, and that og is injective off of a 
Zariski closed subset of C‘%). Further, the group law on Pic®(C) induces 
an algebraic (i.e., rational) map C'9 x C(9) — C%. Unfortunately, this 
rational map is not defined everywhere, so one takes certain “group chunks” 
and glues together enough translates to form a group variety. This idea 
of gluing together group chunks is due to André Weil [3] and works in all 


198 III. Elliptic Surfaces 


characteristics. We refer the reader to the discussion in (IV, §6) for further 
details. See also exercise 4.29. 

The analytic approach to constructing the Jacobian for curves over C 
is much older. The Riemann-Roch theorem [AEC, I.5.3, I1.5.5a] says that 
on a curve of genus g, the space of holomorphic differential forms has di- 


mension g. Let w1,...,wg be a basis for this space. Next, a curve C of 
genus g over C is a Riemann surface with g holes, so there are 2g inde- 
pendent cycles on C. Let T),...,f%2, be a basis for the space of cycles; 


that is, I,,...,[P29 is a basis for the first homology H,(C,Z). We fix a 
basepoint Py € C' and consider the map 


P P 
e:C—-C%, Pr i res f Wg |. 
Po Po 


Here the integrals are to be computed along some path from Pp to P. 
Unfortunately, the value of the integrals is not path-independent! (See 
[AEC, VI, §1] for a discussion when g = 1.) 

In order to salvage this idea, we consider the subgroup A C C9 defined 
to be the image of the map 


A, (C,Z) — C!, Po (fn... fue) 
Tt Te 


Then the integrals give a well-defined map ¢ : C — C9/A, since A elim- 
inates the path dependence of the integrals. Next one proves that A is 
a lattice which satisfies Riemann’s conditions (2.4.2), so C9/A is complex- 
analytically isomorphic to an abelian variety. Denoting this abelian variety 
by Jac(C), one verifies that the map ¢: C — Jac(C) is a morphism of al- 
gebraic varieties. 

Extending @ by linearity gives a map 


@ : Div(C) — Jac(C), So ni(P;) — So [ni]d(P,). 


In other words, use ¢ to map the points in a divisor to points of Jac(C), and 
then use the group law on Jac(C’) to add them up. Finally, the theorems 
of Abel and Jacobi say that the map ¢ : Div°(C) > Jac(C) is a surjective 
homomorphism whose kernel consists of precisely the principal divisors. 
Hence @ induces the desired isomorphism Pic?(C) > Jac(C). For further 
details, see Griffiths-Harris [1, Ch. 2, §§2,7]. o 


Remark 2.6.1. What do we mean in (2.6b) when we say that the isomor- 
phism Pic®(C) — Jac(C) is “natural”? Recall (AEC, 11.3.7] that a mor- 
phism ¢ : Cy — Cy of curves induces a homomorphism ¢* : Pic?(C2) > 
Pic®(C,) of their Picard groups. Then one can prove that the corresponding 
map ¢* : Jac(C2) — Jac(C)) is a morphism of varieties. In fancy language, 
the association C +> Jac(C) is a functor from the category of (non-singular 
projective) curves to the category of abelian varieties. 
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Remark 2.6.2. If C is defined over an arbitrary field k, then its Jacobian 
variety Jac(C’) will be an abelian variety defined over k. Further, the group 
isomorphism Pic?(C) — Jac(C) will commute with the action of the Galois 
group G,,,. This is another way in which the Jacobian is a natural object. 
For example, if C is a curve of genus 1 with C(k) = @, then C cannot be its 
own Jacobian variety. However, we can always find an elliptic curve E/k 
so that C/k is a homogeneous space for E [AEC, exercise 10.3]. Then 
[AEC, X.3.8] says that there is a group isomorphism Pic®(C) > E, so E is 
the Jacobian of C. 


Remark 2.6.3. For hyperelliptic curves, it is possible to construct the 
Jacobian variety quite explicitly, see Mumford [3, Ch. Ia, §§2,3]. In this 
case one can also precisely describe all of the elements of order 2 in Pic(C), 
see exercise 3.38. 


The following corollary of (2.6) and (2.4.2) is a strengthened version 
of (2.2). 


Corollary 2.7. Let C/k be a non-singular projective curve of genus g 
defined over an algebraically closed field k. Then 


Pic(C)tors & (Q/Z)?9. 


In particular, for any integer m > 1, Pic(C)[m] & (Z/mZ)?9, so Pic(C) has 
only finitely many elements of order dividing m. 


ProoF. The field k has characteristic 0 by assumption, so the Lefschetz 
principle [AEC, VI 86] says that we may take k to be a subfield of C. 
Let J = Jac(C) be the Jacobian variety of C. Then (2.6) tells us that 
Pic(C)tors & Pic®(C)tors & Jtors. On the other hand, (2.4.2) implies 
that there is a lattice A C C9 and a complex-analytic group isomor- 
phism J(C) = C9/A. (See also the proof of (2.6) for a concrete description 
of the isomorphism Pic?(C) % C9/A.) 

As abstract groups, CY & R79 and A & Z9, and if we use a Z-basis 
for A as our R-basis for C2, then we obtain a group isomorphism of the 
quotients C9/A & (IR/Z)?9. Hence 


Pic(C) tors a Jtors = J( Cy ices = (C9/A)tors = (R/Z) 29 = (Q/Z)?9. 


tors 


This proves the first assertion of (2.7), and the other assertions are an 
immediate consequence. O 
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83. Elliptic Surfaces 


We now return to the idea that an elliptic surface should be a one-parameter 
family of elliptic curves. For example, we might consider a family 


Er:y?=2°4+ A(T)x + B(T) 


with rational functions A(T), B(T) € k(T). Or, more generally, we could 
fix a non-singular projective curve C/k and take 


E:y=224+Ar+B 


for some A, B € k(C) with 4A°+27B? £0. Then for almost all points t € 
C(k) we can evaluate A and B at t to get an elliptic curve 


E,:y? = 23 + A(t) + Bit). 


Suppose now that we do not evaluate A and B at particular points 
of C’, but instead we treat ¢ as a variable just like x and y. In other words, 
we look at the subset of P? x C defined by 


€ = {([X,¥,Z],t) €P? x C : Y9Z = X9 + A(t)XZ? + BY)Z}. 


Note that € is a subvariety of P? x C of dimension two; it is a surface 
formed from a family of elliptic curves. 


Remark 3.1. Actually, in defining € we need to take a little more care 
with the points t € C where A or B has a pole. Here’s one way to handle 
this problem. Consider the set 
A or B has a pole at ¢, or 
2 ‘ ? 
{ (x. ¥iZ),t) EP! x€ : ya7 _ x34 Aayxz? + Pees 
This set will consist of a number of irreducible components, all but one of 
which will look like 
Pp? x {to} 


for a pole to of A or B. We take € to be the one component not of this 
form. Equivalently, € is the Zariski closure in P? x C of the set 


{(¥%Zh8) cPxc: t is not a pole of A or B, and \ 


Y¥2Z = X34 A(t)XZ? + B(t)Z 
Since € is a subvariety of P? x C, projection onto the second factor 


defines a morphism 


Tw: € — OC, 
((X,¥,Z],t) ro t. 
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And for almost every point t € C, the fiber 
&=a 1) ={P ets a(P)=t} 


is the curve £; that we considered earlier. Further, since we have assumed 
that 
A = -16(4A% + 27B?) 40 in k(C), 


it will be true that almost every fiber €; is an elliptic curve. We just need 
to choose points t € C such that A(t) 4 co, B(t) # oo, and A(t) 40. 

However, our family of elliptic curves € has one other important prop- 
erty. Recall [AEC III §3] that an elliptic curve is really a pair (E,O), 
where F is a curve of genus 1, and O is a point of EF. The equation we 
used to define € gives a one-parameter family of elliptic curves. This means 
that for almost all values of t we get an elliptic curve €;, which we should 
really write as (€;,O;) to emphasize that each €; comes equipped with a 
zero element O; € E;. 

The family € is an algebraic family, which is a fancy way of saying 
that it is given by an equation whose coefficients A and B are algebraic 
functions, in our case functions on the curve C’. The additional property 
that € possesses is that the collection of zero elements O; is an algebraic 
family of points. In other words, we claim that the coordinates of O; are 
algebraic functions of t; the coordinates of O; are in the function field of C. 
Using the definition of E given above, we see that 


O; = ((0,1,0],t) € & CP? xc. 
So the coordinates [0,1,0] of O; are actually constant functions. 
We can describe this property in another way. Since each fiber €; is 


an elliptic curve with zero element O;, we get a map 


oo: C — 6, 
t => O?;. 


Clearly, this map has the property that 

m(oo(t)) =t for all t € C(k). 
Further, the fact that O, is an algebraic family of points is equivalent to 
the fact that the map go is a rational map of varieties. (In fact, since C is 


a non-singular curve, 09 will be a morphism [AEC II.2.1].) This prompts 
the following definition. 


+ The phrases “for almost every” and “for almost all” are contractions of the 
expression “for all but finitely many.” 
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Definition. Let 7 : V — W be a morphism of algebraic varieties. A 
section to m is a morphism 


a:W—-V 
such that the composition 
nroa:W—->W 
is the identity map on W. 
Example 3.2. Consider the surface given by the equation 
Or Zeer a= T XA EV 
(See (1.1) for our original discussion of this equation.) More precisely, let € 


be the projective surface in P? x P! corresponding to this equation. The 
projection 7 is the map 


n:€—P', ((X,Y, Z],T) — T. 
This map has a section 
o:P!—s€, Te ({7,T,1],T). 
To avoid excessive notation, one often says that 7 : € — P! has the sec- 
tion o = [T,T, 1]; one can even use the inhomogeneous equation 
y= — Te +7? 
for € and say that o = (T,T) is a section to 7. 

We are now ready for the formal definition of an elliptic surface. 
Definition. Let C be a non-singular projective curve. An elliptic surface 
over C’ consists of the following data: 

(i) a surface €, by which we mean a two dimensional projective vari- 


ety, 
(ii) a morphism 


T:E—-C 
such that for all but finitely many points t € C(k), the fiber 
& = n*(t) 


is a non-singular curve of genus 1, 
(iii) a section to 7, 
09: CE. 
Let € — C be an elliptic surface. The group of sections of E over C is 
denoted by 
E(C) = {sections 0: C — €}. 

Note that any rational map C — € is automatically a morphism, since C 
is a non-singular curve and € is a projective variety [AEC, II.2.1], so every 
section is a morphism. We will see later (3.10) that E(C) is a group with 
zero element do. 
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Remark 3.3.1. Since all but finitely many fibers of an elliptic surface 
have genus 1, one can show that any fiber which is a non-singular curve 
will automatically have genus 1. See Hartshorne [1, IIJ.9.13]. These non- 
singular fibers are often called the good fibers. The fibers €; which are 
not non-singular curves will be called the singular fibers or the bad fibers. 
Of course, when we refer to the non-singular fiber €;, we really mean the 
elliptic curve consisting of the pair (E1,00(t)). 


Remark 3.3.2. Our definition of elliptic surface is non-standard in two 
ways. First, most books require that € be a non-singular surface. In such 
a case we will call € a non-singular elliptic surface. Second, most algebraic 
geometers would define an elliptic surface to be a (non-singular) surface 
satisfying properties (i) and (ii) of our definition; they would not require 
that there be a section. This leads to many interesting geometric questions, 
such as the possible existence of multiple fibers. (See Griffiths-Harris [1, 
p. 564].) It is only our emphasis on questions with an arithmetic flavor 
which prompts us to require the existence of at least one section. 


Remark 3.3.3. The classical theory of elliptic surfaces deals with surfaces 
defined over the field k = C, or more generally over an algebraically closed 
field. We will also want to look at other fields, such as k = Q. We will say 
that an elliptic surface € over C is defined over k if the curve C is defined 
over k, the surface € is defined over k, and both of the maps 


nm: BC and 09: C—-E 
are defined over k. In this case we write 
E(C/k) = {sections 0 : C > E such that o is defined over k} 


for the group of sections defined over k. For example, the elliptic sur- 
face (3.2) is defined over Q, and the section o = (T,T) is in €(C/Q). 


Let € be an elliptic surface over C’ defined over k. We would like 
to associate to € an elliptic curve E/k(C). Conversely, to each elliptic 
curve £'/k(C) we will assign a birational equivalence class of elliptic sur- 
faces. In this way we will be able to apply our earlier results to study 
elliptic surfaces. 

We begin by recalling some general definitions and basic facts about 
rational maps. For more details, see Hartshorne [1, I §4], Harris [1, Lec- 
ture 7], or Griffiths-Harris [1, 4 §2]. 


Definition. Let V and W be projective varieties. A rational map from V 
to W is an equivalence class of pairs (U, gy), where U is a non-empty Zariski 
open subset of V and @y : U — W is a morphism. Two pairs (U, dy) 
and (U’, ¢u-) are deemed equivalent if dy = @y, on UNU". If ¢ is repre- 
sented by a pair (U, dy), we say that @ is defined on U. 
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The image of a rational map ¢, denoted ¢(V), is defined as follows: 
Let U Cc V be an open subset on which ¢ is defined, take the Zariski closure 
in V x W of the graph 


{ (u, o(u)) EVxW: ueU}, 


and take the projection to W. (N.B. Since ¢ is not actually defined at 
all points of V, this “image” of ¢ may not agree with your intuition. For 
example, there may be points w € ¢(V) which are not equal to é(v) for 
any v € V.) 

A rational map ¢: V — W is dominant if o(V) = W. Equivalently, ¢ 
is dominant if for one (hence every) open set U on which it is defined, the 
image ¢(U) is Zariski dense in W. 

The domain of definition of a rational map ¢: V — W, which we de- 
note by Dom(4), is the largest open subset of V on which ¢ is a morphism. 
(Such a largest set exists, see Hartshorne [1, I exercise 4.2].) 

A rational map ¢: V — W is a btirational isomorphism if it has a 
rational inverse yy: W — V; that is, @ and yw are dominant and 


oow:W WwW and podg:V——AV 


are the identity maps at all points for which they are defined. If there 
is a birational isomorphism from V to W, then V and W are said to be 
birationally equivalent. If V, W, @, and w are all defined over a field k, 
then we say that V and W are birationally equivalent over k. 


Remark 3.4. In [AEC, I §3] we defined rational maps more naively using 
coordinates on P”. The reader will easily check that the two definitions 
are equivalent. For examples of rational maps that are not morphisms, 
see [AEC, 1.3.6 and 1.3.7]. Notice that [AEC, 1.3.7] gives an example of non- 
isomorphic varieties that are birationally equivalent. Another important 
example of this phenomenon is provided by the process of “blowing-up”; 
see Hartshorne [1, I §4] or Harris [1, Lecture 7]. 


Proposition 3.5. Let ¢: V — W be a rational map of projective 
varieties. 

(a) The image 6(V) is an algebraic subset of W. If V is irreducible, then 
so is OV). 

(b) Suppose that V is non-singular. Then @ is defined except on a set of 
codimension at least two. In other words, every component of the comple- 
ment of Dom(¢) in V has codimension at least two. 


PROOF. (a) See Harris [1, Lecture 7, p. 75] or Griffiths-Harris [1, 4 82, 
p. 493] for the first part. The second part is immediate from the definition 
of irreducibility; see exercise 3.7. 

(b) See Griffiths-Harris [1, 4 §2, p. 491]. oO 
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Proposition 3.6. Let V/k and W/k be projective varieties. The follow- 
ing are equivalent. 

(i) V and W are birationally equivalent over k. 

(ii) The function fields k(V) and k(W) are isomorphic as k-algebras. 

(iii) There are non-empty Zariski open sets U; C V and Uz C W defined 
over k such that U, and U2 are isomorphic over k. 


PROOF. This is a standard (and elementary) result in algebraic geometry. 
See, for example, Hartshorne [1, 1.4.5] or Harris [1, exercise 7.10]. Oo 


Let 6 : V — W be a dominant rational map, and let f € k(W) 
be a rational function on W. Then f is defined (i.e., regular) on a non- 
empty open subset of W, and ¢(Dom(¢)) is Zariski dense in W, so the 
composition fo¢@ is a regular function on a non-empty open subset of V. In 
other words, fo@¢ is a rational function on V, so we obtain a homomorphism 
of k-algebras 

k(W) — k(V), 
fo or fo@. 


The next result says that the theory of varieties up to birational equiv- 
alence is essentially the same as the theory of their function fields. (See 
Hartshorne [1, 1.4.4] for a more precise categorical statement.) 


Proposition 3.7. Let V/k and W/k be projective varieties. The associ- 
ation 


dominant rational pea a5 apa a aaa 


V — W defined over k k(W) > k(V) 
d — (f> fog) 
is a bijection. 
Proor. See Hartshorne [1, 1.4.4]. Oo 


We are now ready to apply the theory of rational maps to study elliptic 
surfaces. Recall that according to our definition, an elliptic surface consists 
of three pieces of data: 

(i) a projective surface €, 

(ii) a projection map 7:E —C, 

(iii) a zero section 09: C — E. 

Given two elliptic surfaces E and €’ over the same base curve C, it thus 
makes sense to consider the rational maps € — €’ which commute with 
projections and/or zero sections. This prompts us to make the following 
definitions. 
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Definition. Let 7: € — C and 7’: €' — C be elliptic surfaces over C. 
A rational map from E to €' over C is a rational map ¢@: € — €’ which 
commutes with the projection maps, 7’0 ¢ = 7. The elliptic surfaces € 
and €’ are birationally equivalent over C if there is a birational isomor- 
phism ¢: € > €’ which commutes with the projection maps. If the elliptic 
surfaces and rational maps are defined over a field k, we will say that € 
and €' are k-birationally equivalent over C. 


The next two propositions explain precisely how the theory of elliptic 
curves over k(C’) is the same as the birational theory of elliptic surfaces 
over C’. 


Proposition 3.8. (a) Fix an elliptic curve E/k(C). To each Weierstrass 
equation for EF, 


E:y=2°+Azr+B, A,B €k(C), 
we associate an elliptic surface 
£(A, B) = {([X,Y,Z],t) €P? x C : ¥Y°Z =X? 4+ A()XZ? + BYL)Z} 


as described in (3.1). Then all of the €(A, B) associated to E are k-bira- 
tionally equivalent over C. 
(b) Let € be an elliptic surface over C' defined over k. Then € is k-bira- 
tionally equivalent over C to E(A,B) for some A,B € k(C). Further, the 
elliptic curve 

E:y=2°+Ac+B 
over k(C) is uniquely determined (up to k(C)-isomorphism) by €. 
(c) Let E/k(C) be an elliptic curve and € — C an elliptic surface associ- 
ated to E as in (a). Then 


k(E) = k(C)(E) as k(C)-algebras. 
Here the projection map 7 : € — C induces an inclusion of fields k(C') - 
k(€) which makes k(€) into a k(C)-algebra. 
We say that E/k(C) is the generic fiber of E — C. 
PRoorF. (a) Suppose we take another Weierstrass equation for E/k(C), say 
B:y=2° +A +B’, A,B EK(C). 


Then there is a u € k(C)* such that utA’ = A and u®B’ = B [AEC IIL.1.3}. 
Now the map 


€(A’, B’) — &(A, B), ([X’,Y’, 2], t) > ([w?X’, wy’, 2", t) 


shows that €(A, B) and €(A’, B’) are k-birationally equivalent over C. 
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(b,c) The projection map a: € — C induces an inclusion of function 
fields k(C) — k(€) in the usual way, f + fo. Further, € is a surface 
over k, and C is a curve over k, so k(€)/k has transcendence degree 2 
and k(C)/k has transcendence degree 1. It follows that k(€)/k(C) has 
transcendence degree 1, so there exists a curve E/k(C), unique up to k(C)- 
isomorphism, whose function field k(C)(E) is isomorphic to k(€) as k(C)- 
algebras [AEC II.2.5]. 

We claim that E is a curve of genus 1. To see this, we write E as a 
subvariety of P", so E is the set of zeros of a collection of homogeneous 
polynomials {f;(x) : 1<%i<r} with coefficients in k(C). Note that for 
almost all t € C, we can evaluate the coefficients of the f;’s to get polyno- 
mials with coefficients in k. To indicate the dependence of the f;’s on t we 
will write f;(t,x). Then we can consider the algebraic variety in P” x C 
defined by 


Vv & {(x,t)eP"xC: filt,x) =0 for 1 <i<r}. 


Projection onto the second factor gives a map V — C' which makes k(V) 
into a k(C)-algebra, and by construction we see that k(V) is isomorphic 
to k(C)(E) as k(C)-algebras. Hence k(V) is isomorphic to k(€) as k(C)- 
algebras, so (3.6) tells us that V and € are birationally equivalent over C. 
In particular, for almost all t € C the fibers V; and €; are isomorphic, so 
almost all of the V;’s are curves of genus 1. 

Now suppose that w € Qgxc) is a differential form on E. (See 
[AEC, II §4] for general properties of differential forms on curves.) Any such 
differential can be written as a sum w = ou, dv; with u;,vj € k(C)(£). 
For almost all t € C we can evaluate the u,’s and v,’s at t to get a differ- 
ential form uw, = w(t,x) on V;. Further, if w is a holomorphic differential 
form on £, then uw, will be a holomorphic differential form on V; for almost 
allt eC. 

Let w1,wW2 € Qe/xe(c) be non-zero holomorphic differential forms. We 
claim that they are k(C)-linearly dependent. To prove this, we observe 
that for almost all t € C, the forms wi(t,x) and wo(t,x) are holomor- 
phic differentials on the curve V; of genus 1, so they are k-linearly de- 
pendent from [AEC, I1.5.3,II.5.5a]. In other words, there are non-zero 
constants a,b; € k such that 


a,w (t,x) + bjwo(t,x) = 0. 


But this means that the function w1(t,x)/wo(t,x) € k(V;) is constant; 
that is, it is in k. It follows that the function w;/w2 € k(V) = k(C)(E) is 
actually in k(C), which proves that w; and we are k(C)-linearly dependent. 

To recapitulate, we have proven that the vector space of holomorphic 
differential forms in Q¢ xc) has k(C)-dimension at most one. It follows 
from [AEC, II.5.3,11.5.5a] that EF has genus at most 1. Suppose that E/k(C) 
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has genus 0. We will see below that E (k(C)) is non-empty, so we can 
assume that E = P! and V = P! x C. But then the fibers V; & P! are 
all curves of genus 0, contradicting the fact that V; & €; has genus 1 for 
almost all t. This contradiction shows that E'/k(C) does not have genus 0, 
so the only possibility is that E/k(C) has genus 1. 

Further, the section og : C — E corresponds to a point Po € E(k(C)). 
To see this we use the fact that V and € are birationally equivalent over C 
to get a section og : C — V. This section is a map of the form of = 
[ho,..-, An] for certain functions ho,...,hn € k(C), which is the same as 
saying that Po = [Ao,..., hn] € P"(k(C)) is a point in E(k(C)). Taking Pp 
to be the identity element, E becomes an elliptic curve defined over k(C). 

We have now proven that £’/k(C) is an elliptic curve, so we can take 
a Weierstrass equation for it, say 


E:y=22+Ac+B with A,B eEk(C). 


Then the corresponding surface V is precisely the elliptic surface €(A, B) 
described in (a), and we have already observed above that V is birationally 
equivalent to € over C. This completes the proof of the first part of (b) 
which asserts that every elliptic surface is birationally equivalent over C 
to some €(A,B). Further, we showed above that there are isomorphisms 
of k(C)-algebras 

kK(E) = k(V) = k(C)(E), 


which completes the proof of (c). 

It remains to prove that E is determined up to k(C)-isomorphism by €. 
Suppose that € is also birationally equivalent to €(A’, B’). Then €(A, B) 
and €(A’, B’) are birationally equivalent over C. This means that for al- 
most all t € C, there is an isomorphism on the fibers 


{y? =2° + A(t)e + B(t)} > fy? =27 + A'(t)a + Bi(t)}. 
From general principles [AEC, III.3.1b], we know that this isomorphism is 
given by a map of the form (x, y) + (asx, ,y) for some a,, 6, € k. But the 
birational equivalence €(A, B) — €(A’, B’) is an algebraic map, so we see 
that a and @ are functions on €(A, B) which depend only on t. In other 
words, a, 3 € k(C), which proves that the corresponding elliptic curves 
y=ae'+Ar+B and y? = 23 + Ale + B' 
are isomorphic over k(C). Oo 


Proposition 3.9. There is a natural bijection 


{ dominant rational ie _—_ ee maps E > ay 
E — €' over C defined over k(C) , 
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where we identify the set of elliptic curves defined over k(C) with the set 
of (birational equivalence classes) of elliptic surfaces over C' as described in 
Proposition 3.8. 


PRoor. Let ¢ : € — €&’ be a dominant rational map over C. Choose 

Weierstrass equations for € and €’, or equivalently for E and E’, 
E:y2=234+ Ar+B, El sy? =a + Alc! +B’ 

Then the map ¢ will have the form 


@: ((x,y),t) > (F(x), 9(t,2,y)),t), 
where f,g € k(E) = k(C)(E) = k(C)(x,y). Hence F = (f,g) defines a 
map F: E > E’. 

Suppose that F is a constant map from E to E’ over k(C). This 
means that f,g € k(C), so ¢((z,y),t) = ((f(t), 9(t)), t) depends only on f, 
independent of x and y. It follows that the image of @ has dimension at 
most one, since it is the image of a map C — €’, so in particular ¢ is not 
dominant. This proves that if 6: € — €' is dominant, then the associated 
map F’: E — E’ is non-constant. 

The proof going the other direction is similar. Fix Weierstrass equa- 
tions for F and E’ as above, and let F : E — E’ bea map defined over k(C). 
Then F' has the form F = (f,g) for some f,g € k(C)(E) & k(€), and we 
can define a map @: € — €’ over C by 


co) : (2, y); t) ars ((f (t, z, y), g(t, z, y)) ’ t). 

If ¢ is not dominant, then its image must consist of a curve, since 7 
maps the image ¢(€) onto C. But this means that if we fix (almost any 
point) ¢ € C and vary x,y on the fiber €;, then o((a, y), t) can assume only 
finitely many values. In other words, the map ¢(-, t) : €; — P! takes on 
only finitely many values, so it is constant [AEC II.2.3]. Hence f and g do 
not depend on z,y, so f,g € k(C) and F = (f,g): E — E’ is a constant 
map. This proves that if F : E — E’ is non-constant, then the associated 
map @: € — €’ is dominant, which completes the proof of Proposition 3.9. 

oO 


Our final task is to explain how the set of sections €(C’) has a natural 
group structure. Recall that for almost all points t € C, the fiber €; is an 
elliptic curve (3.3.1), so given any two points on €;, we can add them or 
take their inverses. Let 01,02 € E(C) be two sections to €. We define new 
sections 0; + o2 and —o;, by the rules 

(a, + 02)(t) = o1(t) + o2(t) and (—01)(t) = —(o1(8)), 
valid for all ¢ € C such that the fiber €; is non-singular. We will verify 
below (3.10) that 0, + o2 and —o, define rational maps C — €, so in fact 
they define morphisms since C is a non-singular curve [AEC, II.2.2.1]. The 
next proposition says that this “fiber-by-fiber” addition makes E(C’) into a 
group. 
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Proposition 3.10. Let € — C be an elliptic surface defined over k. 

(a) Let 01,02 € E(C/k) be sections defined over k. Then the maps 0, +02 
and —oy described above are in €(C’/k). 

(b) The operations (01,02) — 01 +02 and 0 — —o make €(C/k) into an 
abelian group. 

(c) Let E/k(C) be the elliptic curve associated to € as described in (3.8). 
Then there is a natural group isomorphism 


B(K(C)) oe 
P= (xp,yp) > (op:t—> ((xp(t), yp(t)), t)). 


PROOF. (a) Take a Weierstrass equation for € — C’ as described in (3.8), 
E:y=2°+Arc+B, A,B €k(C). 
Then a section a; : C — € is given by a pair of functions 
oi:tro (xi(t), yi(t)) 
which satisfy the given Weierstrass equation for (almost all) t € C. Equiva- 
lently xj, y; € k(C) are functions satisfying y? = «3 + Av; + B as elements 


of k(C). By definition, (0, + o2)(t) is the sum of the two points o;(t) 
and o9(t) using the addition law on the elliptic curve 


Ep:y?=2°4+ A(t)z + Bit). 
So the usual addition formula [AEC III.2.3] says that if 21(t) 4 xo(t), then 


(01 + 02)(t) = o1(t) + o2(t) = (x1 (t), n(t)) + (w2(t), yo(t)) 
yo(t) ~ y(t) \ A 
~ (Gasa mes ) 


(We leave it to you to fill in the y-coordinate.) In other words, if 21 4 x2 
in k(C), then the map oj + 02 is given by the formula 


2 
0, +02= (2 un) vy rae), 
2 ey: 


which shows that 0, +092 is arational map from C to € defined over k. Simi- 
larly, ify = x2 and y; # yo, the duplication formula [AEC III.2.3(d)] yields 
the same conclusion. Finally, the map —a, is given by —o; = (%1,—y1), 
so —0, also gives a rational map C — E defined over k. But C' is a non- 
singular curve, so all of these rational maps are morphisms [AEC II.2.2.1], 
which completes the proof that 0, + 72 and —o, are in €(C/k). 
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(b) This is clear from the fact that the points on almost every fiber form a 
group. For example, for any three sections 0),02,03 € E(C/k) and almost 
all points t € C, we have 


((o1 + 02) + 03)(t) = (01 + 02)(t) + o3(t) = (1 (t) + o2(€)) + a(t) 
= 01(t) + (a(t) + o3(t)) = o1(t) + (02 + 93) (t) = (a1 + (02 + 93)) (Et). 


Hence ((01 + 02) + 03) = (01 + (02 + 03)) as sections, which verifies 
the associative law. The other group axioms can be verified in a similar 
fashion. 

(c) Fix a Weierstrass equation for € — C as in (a). If P = (ap,yp) € 
E(k(C)), then xp and yp satisfy the given Weierstrass equation as ele- 
ments of the function field k(C), so (xp(t),yp(t)) € € for almost all t € 
C. This shows that op is a well-defined element of €(C/k). Similarly, 
any a € €(C/k) has the form o(t) = ((zo(t),yo(t)),t) for some rational 
functions £5, Yo € k(C) satisfying 


yo(t)? = xo(t)® + A(t)x,(t) + B(t) for almost all t € C. 


It follows that P, = (2o,Yo) satisfies the given Weierstrass equation, 
so P, € E(k(C)). The identifications P + op and a } P, are clearly 
inverse to one another, so they define bijections E(k(C))  €(C/k). Fi- 
nally, for any P,, Pp € E(k(C)) we have 


(op, a3 oP, )(t) = (xp, (t), yp, (t)) + (xp, (t), yp, (t)) i OP, +P, (t). 


Similarly, -op = o_p, which shows that the map E(k(C)) — E(C/k) isa 
homomorphism, hence an isomorphism. oO 


Remark 3.11. It is important to observe that we can only add points 
on € if they lie on the same (non-singular) fiber. This enables us to add 
two sections together, but it does not make the surface € itself into a 
group. Another way to say this is to use the notion of fiber product (Hart- 
shorne [1, II §3]). The fiber product € xc € of € with itself relative to the 
map 7: € — C is the set of pairs (z1, 22) in the ordinary product € x E 
with the property that a(z1) = m(z2); that is, € xc € consists of all pairs 
of points on € which lie on the same fiber over C.. It is a variety, and the 
“group operation” is then the rational map 


ExcEE 


defined by addition on each (non-singular) fiber. One might say that € 
is a group relative to the projection map 7: € — C. We will see this 
construction appearing in a much more general setting when we discuss 
group schemes in the next chapter. 
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§4. Heights on Elliptic Curves over Function Fields 


Let E/K be an elliptic curve defined over a function field. We have 
proven (2.1) that the quotient E(k)/2E(K) is finite. We would like to 
use the Descent Lemma [AEC VIII.3.1] to prove that E(K) is a finitely 
generated group. This means we need a height function on E(k) that sat- 
isfies certain properties. We begin by defining a height function on K and 
then use it to define a height on E(k). 


Definition. Let K = k(C) be the function field of a non-singular algebraic 
curve C/k. The height of an element f € K is defined to be the degree of 
the associated map from C to P?, 


h(f) = deg(f : C > P). 


In particular, if f € k, then the map is constant and we set h(f) = 0. 
Let £/K be an elliptic curve given by a Weierstrass equation 


y? +a, cy + a3y = 2° + agx? + aga + a6. 
The height of a point P € E(K) is defined to be 


Gs. GERESO, 
BES we if P=): 


(Note that h(f) is really the height relative to the field A, and h(P) depends 
on the choice of a Weierstrass equation for EF, although our notation does 
not reflect this. See [AEC VIII §5].) 


Remark 4.1. For each t € C, let 
ord; : k(C)* — Z 


be the normalized valuation on k(C) [AEC II §1]; that is, ord;(f) is the 
order of vanishing of the function f at the point t. Then [AEC II.2.6a] 
implies that 


h(f) = deg f = $~ max{ord:(f),0} = 5 > max{—ord;(f), 0}. 


tec tec 


This definition of the height as a sum of local values is analogous to the 
definition of the height for number fields described in [AEC VIII §5]. Notice 
that we can count either the total number of zeros of f or the total number 
of poles. 
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The properties that we want the height to possess are of two very 
different sorts. First, we want the height to satisfy certain transformation 
properties relative to the group law on E. We will be able to prove this 
below by a straightforward calculation that is very similar to the proof in 
the number field case. Indeed, there is a theory of height functions for 
a wide class of fields which includes number fields and function fields as 
special cases. (See Lang [4] for details.) 

The second property we require of the height is a finiteness property, 
namely that a set of bounded height should contain only finitely many 
points. In the case of a number field K, we first showed that this was true 
of K itself; that is, a number field contains only finitely many elements of 
bounded height [AEC VII.5.11]. This immediately implied the same result 
for E(K). However, matters are more complicated for function fields, since 
a function field may have infinitely many elements of bounded height. For 
example, the elements of height 0 in k(T’) are precisely the elements of the 
field k. More generally, the elements of height at most d in k(T) are the 
rational functions of the form 


ag + aT + agT? +--+ + agT4 
bo + b1T + boT? +--+ + baT? © 


Thus it is not clear whether F(A’) might possess infinite subsets of bounded 
height. We will postpone further discussion of this question until the next 
section. 

The following proposition summarizes the principal geometric trans- 
formation properties of the height. 


Theorem 4.2. Let E/K be an elliptic curve defined over a function 
field Kk. 

(a) h(2P) = 4h(P) + O(1) for all P € E(K). 

(b) h(P + Q)+h(P — Q) = 2A(P) + 2h(Q) + O(1) for all P,Q € E(K). 
(The O(1) bounds depend on the curve E and can be given explicitly; see 
exercise 3.11.) 


Remark 4.2.1. We will prove Theorem 4.2 using the triangle inequality 
and elementary polynomial computations. Later we will give another proof 
using intersection theory on a non-singular model for the elliptic surface €; 
see (9.3). 


PROOF (of Theorem 4.2). Fix a Weierstrass equation for F of the form 
E:y=234Ar+B. 


For any t € C, we write ord;(f) as usual for the order of vanishing of f € 
k(C) at t. To ease notation and avoid the inevitable confusion caused by 
repeated minus signs, we will also write 


w+(f) = (order of the pole of f at t) = — ord, (f). 
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Note that the (non-archimedean) triangle inequality for 7; has the form 
mf +f") <max{m(f),m(f")}, with equality unless m(f) = m(f’). 


(a) Let P = (x,y) € E(K). The duplication formula [AEC III.2.3d] says 
that z(2P) = ¢/w, where 


} = O(a) = 21 —2Az*—8Br+ A? and w= (2) = 407+ 4Ar+4B. 
We compute 
h(2P) =h(x(2P)) definition of height 
= S> max {m(¢/v), 0} from (4.4) 


tEeC 
a $5 max {m(¢), m(v)} since So mv) = 0. 
tEC tEC 


So we need to show that max{m(¢),7()} and 4max{m(zx),0} are ap- 
proximately equal. If x has a large order pole at ¢t, this is fairly clear, 
since ¢ = «+ + --- will then have a pole four times larger than z. On 
the other hand, if 2 does not have a large pole, we will be in good shape 
provided ¢ and ~ don’t both have large order zeros at ¢. In order to make 
these vague comments precise, we define a quantity 


e 1 1 
Me = Lt ( EB) det max { F(A), sm(B).0} 


and consider the following two cases. 


[ae() > pe (x has a large pole at t)| 

The definition of 4; shows that in this case we have strict inequalities 
m(x*) > max{m;(2Ax*), m:(8Bx), ™(A?)}, 
m(a°) > max{m,(Az), 7,(B)}. 


It follows from the triangle inequality that 7(¢) = m:(x4) and m(w) = 
m(x). We also have 7;(z) > p > 0, which proves that 


max{m,(¢), 7 (v)} = m(x*) = 4max{m(z), 0}. 


[re(a) < p+ (ax has a small pole at t)| 


The triangle inequality gives us trivial upper bounds 


THO) < max{m;(x*), 7(2Ax*), ™(8Bax),7(A7)} 
< max {4pir, m(A) + Que, me(B) + pr, 274(A) } 


< Au since 7(A) < 2uz and 7(B) < 32, 
and 
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m™m(w) < max{7;(x*), m(Azx), ™(B)} 
< max{ 3p, (A) + pe, 7(B) } 
< Sut. 


Hence 
max{7(@), me(~) } < 4p < Spe + 4max{m;(zx),0}. 


In order to get a lower bound, we need to know that ¢ and w cannot 
both vanish to high order at t. We define functions 


® = 1227 + 16A, WU = 32° — 5 Ax — 27B, A = 4A? + 27B?, 
and observe that there is an identity 
®-6—-VU-w=4. 


(We’ve used this identity many times before, for example [AEC VIII.4.3].) 
Note that the discriminant A = 443 + 27B? € k(C) is not identically zero, 
since £'/k(C) is assumed to be non-singular. Intuitively, our assumption 
that m:(x) < pf implies that @ and UW have bounded poles at t, and then 
the identity says that ¢ and w cannot both have high order zeros at t. More 
precisely, we start with the upper bounds 


m(®) < max{m(12x7), m(16.A)} < yee, 
m(W) < max{m(3a*), m(5Azx), m(27B)} < 3yt. 


Next, using the above identity, we find that 


ord;(A) = ord;(@¢ — Wy) > min{ord,(¢), ordy(w)} — max{m(®), m(V)} 
> min {ord;(@), ord,(w) } — 3. 


Now multiplying by —1 and using — ord; = 7; yields 


max{m,(d), 7:(w)} > —3py, — ord, (A) 
> —3y — ord;(A) — 4 (ue — max{7;(x), 0}) 
since 1, > max{7;(x), 0} 
= —7, — ord;(A) + 4max{7;(zx), 0}. 


Combining the upper and lower bounds in this case, we obtain 
—T pe — ordy(A) < max{m(d), m(wW)} — 4max{m (x), 0} < dye. 
In both cases we have now proven bounds of the form 


c1(t) < max{m(¢), m(W)} — 4max{m(zx),0} < c(t), 
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where the quantities c)(t) and co(t) have the property that they are inde- 
pendent of the point P = (x,y) and are equal to zero for all but finitely 
many t € C. Summing this inequality over t € C gives the desired estimate 


C1 < h(2P) a 4h(P) < c2 
for certain constants c; = c;(Z) which do not depend on P. 


(b) If P= O or Q = O, the assertion is trivial, and if P = +Q, then (b) 
reduces to (a). So we will assume that P,Q 4 O and P #+Q. We write 


P= (t1,y1), Q = (xo, y2)s P+ Q = (x3, ys), P— Q = (4, ya). 


The condition that P # +Q ensures that the coordinates are all finite. The 
addition formula [AEC III.2.3d] on the elliptic curve gives 


2 
= A 2B—2 
n= (2=*) Sick pes Wal mito uta) 2B S Siva 


tq- 2 (xq — @)? 

yoty \? (A+21%2)(21 + 2) +2B + 2yryo 
L4 = Ly w2 = 2 e 

I2—- 24 (x2 — 21) 


Next we compute 
A(P + Q) + h(P — Q) = h(x3) + A(x) 
= > max {m:(x3), 0} + max{m(zx4),0} 


tec 
< So max{m:(ar324), (x3 + v4), 0}, 
tec 


where the last inequality needs some justification. In fact, for any func- 
tions a, b,c,d € k(C)* we have 


max{7;(a), 7¢(b) } + max {7,(c), m:(d)} = max{7;(ac), ™(ad + bc), m(bd) }. 


This is easily verified using the triangle inequality and checking the various 
cases. For reasons which will become apparent in a moment, we will add 
0 = > m((a1 — 22)”) to both sides of this inequality, which yields 


W(P +Q) +h(P — Q) < })max{m ((t1 — 22)?ars4), 
tec 


my ((x1 — @2)?(x3 + @4)), 7 ((w1 — £2)”)}. 


Next we use a little algebra and the fact that the points P; and P, lie 
on & to compute 


(a, — ©2)?2304 = (a122 — A)? — 4B(xz1 +22), 


(x, — £2)" (x3 +24) = 2(21 + 22)(A+ 2122) + 4B. 
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Substituting in above and using the triangle inequality yields 


h(P + Q)+h(P -Q) 
x So max {mr (2122 = A)? —4B(a,+ £2)), 


ee m,(2(@1 + @2)(A + 2142) + 4B), m7 ((a1 = z2)?)} 
< S> max {mm (2723), me(Arix2), 7:(A?), m™(Bx1), (Baa), 
tec 


m:(Ax1), (Axe), me (2722), me(2123), 
me(B), me(x?), we(L122), Te(@3) } 
< Si(2 max{m;(z1),0} + 2max{7;(x2),0} 
te +2 max{7;(A),0} + max{m(B),0}) 
= 2h(x1) + 2h(x2) + 2h(A) + h(B) 
= 2h(P) + 2h(Q) + O(1). 


It remains to prove an inequality in the opposite direction. It is possi- 
ble to do this directly, as is done, for example, in [AEC VIII.6.2]. But we 
will instead use the following clever trick which is due to Don Zagier. We 
have proven that the inequality 


2h(P) + 2h(Q) > h(P + Q) +h(P — Q)+ O(1) 


holds for all P,Q € E(K). Given two points P’,Q’ € E(K), we apply this 
identity with P = P’+ Q’ and Q = P’ — Q’ and then use (a) to obtain 


2h(P’ + Q’) + 2h(P’ — Q’) > h(2P’) + h(2Q’) + O(1) 
= 4h(P’) + 4h(Q’) + O(1). 


Dividing by 2 gives the opposite inequality, which completes the proof 
of (b). Oo 


Just as in the number field case, we can construct a canonical height 
which is a quadratic form on the group E(K). However, in order to prove 
that the canonical height is non-degenerate, we need to know that sets of 
bounded height are finite. The exact conditions under which this occurs 
will be described in the next section. We have included the construction of 
the canonical height here, since it seems to fit in better with the material 
in this section. We hope the reader will excuse this textual non-linearity. 


Theorem 4.3. Let E/K be an elliptic curve defined over a function 
field K = k(C). 
(a) For every point P € E(K), the limit 


* 1 1 
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exists. The quantity h(P) is called the canonical (or Néron-Tate) height 
of P. 
(b) The canonical height has the following properties: 
(i) h(P) = $h(P) + O(1) for all P € E(K). 

(ii) hA(mP) = m?h(P) for all P € E(K) and all m € Z. 

(iii) A(P + Q) + A(P — Q) = 2h(P) + 2A(Q) for all P,Q € E(K). 
(c) The canonical height is a quadratic form on E(K). In other words, 
h(—P) = h(P), and the pairing 

(., +): E(K) x B(K) > R 
(P,Q) = h(P + Q) — h(P) — h(Q) 

is bilinear. (N.B. The pairing is normalized so that h(P) = 3(P, P).) 
(d) Assume that E does not split over k. (This means that E is not K- 


isomorphic to an elliptic curve defined over k. See §5 for more details.) 
Then h(P) > 0, and 


h(P) =0 if and only if P is a point of finite order. 
(e) Any function E(K) — R which satisfies (b)(i) and (b)(ii) for some 
integer m > 2 is equal to the canonical height. 


PROOF. We will just briefly sketch the proof, since it is exactly the same 
as in the number field case [AEC VIII.9.1, VIII.9.3]. For any integers n > 
m > 0 we have 


n-1 
|4-"A(2"P) — 4-™A(2"™P)| = De 4-1 p(git py — 4-‘n(2'P)| 
o, 
< S047" |h(2- 24) — 4h(2‘P)| 
a 
< SS 4~*1O(1) from (4.2a) 
=o 4""100) S04). 


This shows that the sequence 47" h(2" P) is Cauchy, hence converges, which 
proves (a). Further, taking m = 0 and letting n — oo gives 


|2h(P) — h(P)| < O(1), 


which is (b)(i). 
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Next we apply (4.2b) to the points 2”P and 2"Q to obtain 
h(2"(P + Q)) + h(2"(P — Q)) = 2h(2”P) + 2h(2"Q) + O(1). 


Dividing by 4” and letting n — oo gives (b)(iii). Evaluating (b)(iii) at P = 
m@ yields the identity 


A((m+1)Q) +h((m — 1)Q) = 2h(mQ) + 2h(Q). 


Taking m = 0 gives h(—Q) = A(Q), and then an easy induction (up and 
down) on m gives (b)(ii). This completes the proof of (b). 

It is a standard computation to show that a function satisfying the 
parallelogram law (b)(iii) is a quadratic form; see for example the proof 
of [AEC VIII.9.3c]. This gives (c). 

If P € E(K) has finite order, then 2”P takes on only finitely many 
values, so it is obvious from the definition that h(P) = 0. Conversely, 
suppose that h(P) = 0. Then for all m € Z we use (b)(ii) and (b)(i) to 
compute 


h(mP) = 2h(mP) + O(1) = 2m?h(P) + O(1) = O(1). 


It follows that {mP : m € Z} is a set of bounded height. We will prove in 
the next section (5.1) that if E does not split over k, then sets of bounded 
height are finite. Hence P is a point of finite order, which completes the 
proof of (d). 

Finally, suppose that g : E(k) — R satisfies (b)(i) and (b)(ii) for 
some m > 2. Then we compute for every P € E(K) and every i > 1, 


2h(P) — 2g(P) = m-*! (2h(m' P) — 2g(m' P)) 
=m~*'((h(m' P) + O(1)) — (h(m'P) + O(1))) 
= O(m-**), 


Letting i — oo shows that h(P) = G(P). Oo 


Remark 4.3.1. It is also possible to construct the canonical height using 
intersection theory on the minimal elliptic surface associated to E. See §9 
for details, especially (9.3). One consequence of the geometric construction, 
which is not at all evident from the definition, is that for function fields 
the canonical height h(P) is always a rational number. This is (probably) 
false for number fields, where it is conjectured that h(P) is transcendental 
for all non-torsion points. 
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85. Split Elliptic Surfaces and Sets of Bounded Height 


We proved in the last section that the height function h : E(K) — R 
behaves nicely with respect to the group law on E. In order to prove 
that E(K) is finitely generated, it remains to show that sets of bounded 
height in E(A) are necessarily finite. The reader will recall that in the 
case of number fields, this was comparatively easy to do. Unfortunately, 
for function fields it is easy to construct a counterexample to this assertion! 

For example, let Eo /k be an elliptic curve, let E = Eg xC be the elliptic 
surface with € — C being projection onto the second factor, and let E/K 
be the corresponding elliptic curve over K. Then every point y € Eo(k) 
gives a section 


oy: C—E=E)xC, o~(t) = (7, ¢), 


and this section corresponds to a point P, € E(k). Clearly, distinct 7’s 
give distinct P,’s, and just as clearly the map 


Eo(k) > E(K), y+ > Py, 


is a homomorphism. It follows that E(K) cannot possibly be finitely gen- 
erated, since the fact that k is algebraically closed means that Eo(k) is not 
finitely generated. (For example, if k = C, then E(k) = C/A for some 
lattice A C C.) 

It will turn out that this is the only way in which E(K) can fail to be 
finitely generated, which suggests that we make the following definition. 


Definition. An elliptic surface € — C splits (over k) if there is an elliptic 
curve Eg/k and a birational isomorphism 


i:€  EyxC 
such that the following diagram commutes: 
E — Ey x C 
ON LY Proje 
C 


There are several other ways of characterizing split elliptic surfaces. 
The following one will be used later in this section. For others, see exer- 
cises 3.9 and 3.10. 


Proposition 5.1. Let € —C be an elliptic surface over k, and let E/K 
be the corresponding elliptic curve over the function field K = k(C). The 
following are equivalent: 
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(i) The elliptic surface € — C splits over k. 
(ii) There is an elliptic curve Eo/k and an isomorphism E —> Eo defined 
over K. 


PROOF. Suppose first that 7: € — C splits. This means that there is a 
birational isomorphism i: € — Ep x C so that proj, oi = 7. A dominant 
rational map induces a corresponding map on function fields (3.7), so we 
obtain an isomorphism k(€) & k(Eo x C) which is compatible with the 
inclusions 

k(C) — k(E) and k(C) — k(Eo x C). 


In other words, if we let K = k(C) as usual, then the fields k(€) = K(E) 
and k(Eo x C) = K(£o) are isomorphic as K-algebras. Each of them is 
a field of transcendence 1 over A, so each corresponds to a unique non- 
singular curve defined over K (see [AEC, II.2.5] or Hartshorne [1, 1.6.12]). 
In other words, there is an isomorphism EF & Eg defined over K. This 
completes the proof that (i) implies (ii). 

Conversely, suppose that we are given an elliptic curve Eo/k and an 
isomorphism E —> Ep defined over K. Then K(E) & K(Eo) as K- 
algebras, which is the same as saying that k(E) © k(Eo x C) as k(C)- 
algebras. Again using (3.7), this isomorphism of fields induces a birational 
isomorphism of varieties € > Ey x C commuting with the maps to C, which 
shows that € — C is split over k. Hence (ii) implies (i), which completes 
the proof of (5.1). oO 


Example 5.2. Note that the isomorphism in (5.1lii) is not required to be 
defined over the constant field k. In fact, since E is only defined over Kk, 
it really only makes sense to talk about maps being defined over K . For 
example, take C = P! and K = k(T), and consider the elliptic surfaces 


fy Sa? 1, Eo:yi =x? +7, 
€g3:y%=2°4+T, Eg:y® =e 4+a4T. 
Also let Eo/k be the elliptic curve 
Eo: y =x? +1. 


Then €, is clearly split over k, since it is precisely Ey x C. The surface €2 
also splits over k, as can be seen from the isomorphism 


Eg = Eo x C” ((a, y), t) aad ((t-*2, ty), t). 


The elliptic surface €3 does not split over k, although it will split if we 
replace the base field k(T) by the larger field k(T1/°). Finally, €4 does not 
split over k; and since its j-invariant is non-constant, it will still not split 
even if we replace k(T’) by a finite extension. See exercises 3.9 and 3.10 for 
general statements. 
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Remark 5.3. For a number field K, it was not hard to show that there are 
only finitely many elements of K having bounded height [AEC VIII.5.11], 
which immediately gave the same result for E(A). Unfortunately, this 
assertion is clearly false for function fields, since there may be infinitely 
many maps C — P! of any given degree. In other words, for number fields 
we proved that a set of bounded height 


{P € E(K) : h(P) < d} 


is finite by reducing to an assertion about elements of bounded height in K. 
But for function fields we will need a new sort of argument which makes 
use of the fact that the coordinates of P = (x,y) satisfy the equation of 
an elliptic curve. Further, we need to rule out split elliptic curves, since 
they will have infinitely many points of bounded height. All of this will 
help to explain why the proof of the following result is far from trivial and 
requires techniques different from those used when studying number fields. 
The proof will take us the rest of this section. 


Theorem 5.4. Let € — C be an elliptic surface over an algebraically 
closed field k, let E/K be the corresponding elliptic curve over the function 
field K = k(C), and let d be a constant. If the set 


{Pé€ E(K) : h(P) < d} 
contains infinitely many points, then € splits over k. 


PROOF (of Theorem 5.4). We will divide the proof of Theorem 5.4 into 
two steps. The first step says that if € has infinitely many sections of 
bounded degree (i.e., E(K) has infinitely many points of bounded height), 
then there is a one-parameter family of such sections. The second step says 
that if there is a one-parameter family, then € splits. We begin with the 
existence of the family. 


Proposition 5.5. Under the assumptions of Theorem 5.4, there is a (non- 
singular projective) curve [/k and a dominant rational map ¢:T x C > € 
such that the following diagram commutes: 


[xc wes & 
proj2 \, “on 
C 


PROOF. We fix a Weierstrass equation for E/K of the form 


E:y=2°+Ar+B with A,Be kK =k(C), 
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and we define a set 


def 


E(K,d) © {Pe E(K) : h(P) <a}. 


Our assumption is that E(K, d) is infinite, and we wish to use this fact to 
find an appropriate curve [ and map T x C — € 
The first step is to parametrize the set of maps from C to P?. Recall 
that if D is a divisor on C, then L(D) is defined to be the space of rational 
functions 
L(D) = {f € k(C) : div(f) + D > 0}. 


This is a finite dimensional vector space whose dimension is denoted by 
€(D). (For basic facts about L(D), see [AEC, II §5].) Taking three functions 
from L(D) will define a map from C to P?, so we get a natural map 


L(D)3 ~ {0} —> Map(C, P?), 
(Fo, Fi, F2) > (t+ [Fo(t), Fi(t), Fa(e))). 


Multiplying (Fo, F,, Fo) by a scalar clearly gives the same map C — P?, so 
we actually have an association 


pe(P)-1 ~ aes {O} Ss Map(C, P?). 

The key here is that we have taken a collection of maps in Map(C, P?) 
and have parametrized this collection using the points of the algebraic 
variety P°—!, where to ease notation we will write @ for @(D). Some of these 
maps C — P? will actually correspond to elements of E(K). The next step 
is to show that the maps corresponding to E(K) form an algebraic subset 
OE Pet, 

For simplicity, we will assume henceforth that D > 0, and we fix a 
basis fi,..., fe for L(D). Further, we choose a divisor D’ > 3D large 
enough so that 1,A4,B € L(D’ — 3D), and we let hi,...,h, be a basis 
for L(D’). 

Every element in L(D)? can be written uniquely in the form 


e e e 
F = (Fy, Fo, Fe) = Ox afr bifis cfs): 
i=l Pm -1 


Such an F' will give an element of E(/) if and only if Fy, Fy, F. satisfy the 
homogeneous equation of E, 


FeF, = F3 + AF, F? + BF®, 


In other words, F will give an element of E(K) if 
se bf) o3 cifi) = o3 afi) A (2 aifi) Os ccf) 
3 
+B 63 cifi) . 
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Multiplying this out gives a sum involving monomials of the form fi fj fi 
and Afifjfe and Bfif; fe. Our choice of D’ ensures that each of these 
monomials is in L(D’), hence can be written uniquely as a linear com- 
bination of hy,...,h,. So finally we end up with an equation that looks 
like 


r 
S> G;(a, b, c)hi = 0, 
i=1 
where each ®; is a homogeneous polynomial in the coordinates 
[ay by) = [egy cine, bry, be, Cirecs eh Pt. 


Now the maps C’ — P? from above which correspond to elements of E(K) 
are associated to the points of the variety 


Vp © {[a,b,¢] ¢ P“)-! : (a,b,c) =0 for all 1 <i <r}. 


Example 5.5.1. We briefly interrupt the proof of Proposition 5.5 to pre- 
sent an example. If we take C = P! and K = k(T), then E/K has a 
Weierstrass equation of the form 


yr =a?+A(T)e+B(T) with A,B Ee &{T). 
We let D = n(oo), which means that L(D) is the set of polynomials in &[T] 


of degree at most n. (Here @(D) = n+ 1.) The corresponding family of 
maps P! — P? is parametrized by P?"+? as described above, 


parte = Map(P!, P?), 
n n n 
[@0,---;@n,00,---;8n;Co++-3Cn] Dar ye Wat |, 
i=0 i=0 i=0 


Writing A(T) = 5° A;T’ and B(T) = >> B;T", this map P! — P? will give 
an element of E(K) if it satisfies the equation 


(ar) (Con) = (Car | 
zs os A,T") os aT") Oe er"). & ox B,T') 3 ciT") 


Multiplying this out and writing it as a polynomial in T gives a formula 
that looks like 


S> G;(a, b, c)T" = 0, 
i=0 


and the system of homogeneous equations ®yg = ©; = --- = ©, = 0 defines 
the variety Vp. 
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To illustrate how this procedure works in practice, we will consider the 
elliptic curve 
E:y=23+T?2-1. 


We will find all points of degree at most 1 by setting 
P= [ao + a1T, bp + b1T, co + eT] 


and substituting P into the (homogenized) equation for E. Multiplying 
everything out, we obtain the equation 


a,c2T? + (2aycoe, + agc?)T* + (a? + arc? — b2e, + 2agcocr — c2)T* 
+ (3aga? — b2e9 + agcz — 2bobic, — 3coc?)T? 
+ (3agai — 2bobic9 — beer a 3c5c1)T + (a3 ce béco = ©) =0. 
Setting the coefficients equal to 0 gives six homogeneous equations for the 


six variables ag,...,¢1. These six equations define the variety Vp C P®. 
After some work one finds that Vp consists of the 3 lines 


{[0, 0, u,v, 0, 0]} U {[0, 0, ix, iv, u,v] } U {[0,0, —iu, —év, u,v] } 
and the 22 isolated points 


{[0, 61, C2, 0, 1, 0] : Ge = CG = -1} U {[p?, 0,0, p, 1, 0] . p° = 1} 
U {[-2p7, ¢, 3¢p*, 2p, 1, 0] : C =—-l, p® = 1}. 


Note that although Vp itself is not zero-dimensional, its image in E(K) 
consists of a finite set of points, namely 


(0,1,0], [0,¢,1], [G7,¢2,1],  [9?,eT,1], [-26? + CT, 3¢p? + 2pT, 1], 


where ¢, (1, C2, and p satisfy ¢? = (2? = G2? = —1 and p® = 1. For 
example, all of the points on the line [0,0,iu,iv, u,v] in Vp are mapped 
to the single point [0,7u + ivT, u + vT] = [0,1,1] € E(K). We also observe 
that of these 24 points in E(K), only the 3 points [0, 1,0] and [1, +7, 1] are 
in E(Q(T)). 

We now resume the (regularly scheduled) proof of Proposition 5.5. 
Recall that we have constructed a variety Vp and a map Vp — E(K). The 


following lemma shows that if deg(D) is sufficiently large, then the image 
of Vp in E(K) will contain E(K, d). 


Lemma 5.5.2. Let g be the genus of the curve C’. If 


1 
degD>g+t+ ad+ 3 (deg A + deg B), 
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then the image of Vp in E(K) contains E(K,d). (For a more accurate 
estimate, see exercise 3.13.) 


PROOF (of Lemma 5.5.2). This is an exercise using the Riemann-Roch 
theorem. Let P = (xp, yp) € E(K, 4d), so, by definition, deg(zp) = h(P) < 


d. Using the Weierstrass equation for E’, we can check that deg(yp) is also 
bounded, 


2deg(yp) = deg(x}, + Arp + B) < 3deg(xp) + deg(A) + deg(B), 
3 1 : 
deg(yp) < 5a + 5 (deg(A) + deg(B)). 


In order to prove the lemma, we need to find functions Fo, Fi, Fh € L(D) 
such that (tp,yp) = (Fo/ Fo, PF, /F9). 

Recall that any function f € K defines a map f : C — P!, and the 
divisor of f has the form 


div(f) = divo(f) — divoo(f) = f*((0)) + f*((co)), 


where divo(f) and div..(f) are the divisors of zeros and poles of f respec- 
tively. (See [AEC II.3.5].) We also note from [AEC, II.3.6(a)] that 


deg(f) = deg (divo(f)) = deg (divoo(f)). 
We are going to apply the Riemann-Roch theorem to the divisor 


DY” ag D- div..(xp) = div.(yp), 


whose degree we estimate as 
deg(D") = deg(D) — deg(xp) — deg(yp) 
> deg(D) —d— (5a+ 5 (deg( A) + dex(B)) J > g. 
The Riemann-Roch theorem [AEC, II.5.4] then tells us that 
iD") > deg(D") ~g +12 1; 
so there exists a non-zero function F € L(D”). We claim that the three 


functions 
fo = Frp, F, = Fyp, Fy =F, 


are all in L(D), which will complete the proof of the lemma. 
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To check this last assertion, we use the fact that div(F) + D” > 0 and 
compute 


div(Fzp) + D = div(Fzp) + D” + divao(xp) + divoo(yp) 
= div(F) + D” + divo(zp) + divoo(yp) = 0, 
div(Fyp) + D = div(Fyp) + D” + divo(xp) + divoo(yp) 
= div(F) + D” + divo(zp) + divo(yp) > 0, 
div(F) + D = div(F) + D” + divao(xp) + divoo(yp) = 0. 


This completes the proof of Lemma 5.5.2. oO 


Continuing with the proof of Proposition 5.5, we fix a divisor D > 0 
whose degree is large enough so that we can apply (5.5.2). Then (5.5.2) 
and our assumption that E(K,d) is infinite tell us that the image of Vp 
in E(K) is infinite. 

We now change perspective a bit and consider the associated elliptic 
surface € + C. We have assigned to each point y € Vp a point P, € E(K), 
and (3.10c) says that the point P, corresponds to a section 0, :C — €. In 
this way we get a natural rational map 


@:VpxC— €, (y,t) > o,(t). 


It is clear that ¢ is an algebraic map, since using notation from above, we 
see that ¢ can be written as 


o((a, b, cl, t) t) = [ah Ooi fi(t), Desilt)]. 


If there exists an irreducible curve T C Vp such that the map 
o:TxC—-€, (7, t) > o,(t) 


is dominant, then the proof of Proposition 5.5 will be complete. So we 
assume that ¢: Tx C — E is not dominant for every irreducible curve TC 
Vp and derive a contradiction. 

Let [ C Vp be an irreducible curve. We are assuming that 6: [xC — 
€ is not dominant, so the image ¢(I x C) has dimension at most one. 
However, we know that 7(o,(t)) = t, which shows that 7 maps (I x C) 
onto C. It follows that the image ¢([ x C) must have dimension exactly 
one. Further, the product I’ C is irreducible, so @(T x C) is also irreducible 
by (3.5a). Therefore ¢([ x C) must consist of a single irreducible curve. On 
the other hand, for any given y € I the map a, is a section to 7: € — C, 
so we have 


o({y} x C) =0,(C) =C 


228 III. Elliptic Surfaces 


is already an irreducible curve contained in ¢(T' x C). Hence o({7} x C) 
must be equal to ¢(f x C) for every y € I. Equivalently, every y € I gives 
the same section 0, which means that the image of [ in E(K) consists of 
a single point. 

We have now shown that every irreducible curve ! C Vp maps to a 
single point in E(K). But any two points in any connected component 
of Vp can be linked by a connected chain of irreducible curves. (In fact, 
on any irreducible component of Vp, any two points can be connected by 
a single irreducible curve; see exercise 3.14.) Since Vp has only finitely 
many connected components, it follows that the image of Vp in E(K) 
is finite. This contradicts the fact shown above that the image of Vp 
contains E(K,d). Hence there exists an irreducible curve T C Vp with 
the property that ¢: T x C — € is dominant. Replacing I with a non- 
singular model for I’ (see Hartshorne [1, 1.6.11]) completes the proof of 
Proposition 5.5. oO 


The following proposition, taken together with (5.5), completes the 
proof of Theorem 5.4. 


Proposition 5.6. Let 7: € — C be an elliptic surface over k, let T/k be 


a non-singular projective curve, and suppose that there exists a dominant 
rational map ¢:T x C > € so that the following diagram commutes: 


Txc a} € 


proj2 \ L T 
C 
Then € splits. 


Proor. The fact that ¢ is a dominant map of varieties of the same di- 
mension means that there is a non-empty Zariski open subset €° of € over 
which ¢ is a finite map, say of degree m. Let to € C be a point such that 
the fiber €,, is non-singular and such that ¢ is well-defined at every point 
of I x {to}. Note that the set of such to’s is a non-empty Zariski open 
subset of C,, since €; is non-singular for all but finitely many t € C, and ¢ 
is well-defined except at finitely many points of T x C by (3.5b). To ease 
notation, we let Eo = €:,. Note that Eo/k is an elliptic curve. 
We define a map 


wy: €° 4 x Eo x C, 
((z,y),t) > (© o(%s to);t) 
i=l 


where the points y; are determined by the formula 


m 


* ((x,y),t) = So (v5 4). 


w=1 
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In other words, w takes a point on €°, pulls it back by ¢ to get a collection 
of m points on IT’ x C (counted with multiplicity), changes the t-coordinate 
to to to get a collection of points on T x {to}, uses ¢ to push them forward 
to a collection of points on Eo, and finally uses the group law on Ep to add 
them up. 

Note that the map w is a well-defined rational map on €°. This is 
true despite the fact that the definition of 7 involves applying ¢7~', since 
ultimately we take a symmetric expression of the points in 67! ((z, y), t), 
so the resulting point can be expressed as a rational combination of z, y, t. 
This is clearest for those points (Gia), € €° for which o-'((x,y), t) 
consists of m distinct points, which suffices for our purposes since we only 
need to define % on an open subset of £9. 

(Aside: An alternative description of ~ is as follows. Let I”) be 
the m-fold symmetric product of [. Then the map ¢ defines in a natural 
way a morphism from €° to P(™ x C which sends a point in €° to the 
collection of points in its inverse image. Next we use the map ¢(-,to): IT > 
Eo to map 1(™ x CO > Be x C. Finally, the summation map Em — Eo 
using the group law on Ep gets us to Eo x C, and the composition of all 
these maps is w : €° — Ey x C. For information about the symmetric 
product, see Harris [1, Lecture 10, especially 10.23].) 

Note that if ~ were a (birational) isomorphism, we would be done. 
Unfortunately, there is no reason that this should be true. We begin our 
analysis of the map w by computing it on the fiber over to. 


w(x, y), to) aaa ( oa (7, to); to) = (m(a, y), to). 
(y,t')EO* ((x,y),to) 
Thus ~ : €4, — Eo x {to} is just the multiplication-by-m map on Ep. 
In particular, since the multiplication-by-m map is surjective, we see that 
w(E°) contains Eo x {to}. This implies that the rational map € > Ep x C 
is dominant, since otherwise the irreducibility of y(€°) would imply that 
w(E°) = Ep x {to}, contradicting the fact that (€°) maps onto C (ie., 
w(€°) must contain at least one point on each fiber on T x C > C). 

We now consider the elliptic curve E'/K associated to the elliptic sur- 
face €. We also take the elliptic curve Eo/k and think of it as the el- 
liptic curve Eo/K associated to the split elliptic surface Ey x C. Then 
the dominant rational map w : E — Epo x C defined above corresponds 
to a non-constant map E — Ep of elliptic curves over K (3.9). Any such 
map can be written as the composition of a translation followed by an 
isogeny [AEC, III.4.7], so we obtain a non-zero isogeny 


A: E — Eo 


defined over K. Taking the dual isogeny [AEC, III §4] gives a map ES 
E defined over K, and this induces an isomorphism [AEC, ITI.4.11,III.4.12] 


\: Eo/ker(A) oD 
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Now ker(A) is a finite subgroup of Eo(K), so the fact that Eo is defined 
over k implies that ker(\) C Eo(k). To see why this is true, take a Weier- 
strass equation for Eo with coefficients in k. Then the n-torsion points of E 
have coordinates which are roots of certain polynomials having coefficients 
in k, so E[n] C E(k). (For explicit formulas, see [AEC, exercise 3.7]. Note 
we are assuming that & is algebraically closed.) It follows that the elliptic 
curve Fy ae Eo/ ker(A) is defined over k. 

We have now produced an elliptic curve E,/k and an isomorphism of 
elliptic curves E; — EF defined over K. It follows from (5.1) that E'/K splits 
over k. This completes the proof of Proposition 5.6 and, in conjunction with 
Proposition 5.5, also completes the proof of Theorem 5.4. oO 


§6. The Mordell-Weil Theorem for Function Fields 


We have now assembled all of the tools needed to prove the following im- 
portant result. 


Theorem 6.1. (Mordell-Weil Theorem for Function Fields) Let € — C 
be an elliptic surface defined over a field k, and let E/K be the correspond- 
ing elliptic curve over the function field K = k(C). If € — C does not split, 
then E(K) is a finitely generated group. 


PROOF. Suppose first that k is algebraically closed. The weak Mordell- 
Weil theorem (2.1) tells us that the quotient group E(A)/2E(K) is finite. 
Next let h : E(K) — Z be the height function defined in §4. This height 
function satisfies 


(i) h(P + Q) = 2h(P) + 2h(Q) — A(P — Q) + O(1) 

< 2h(P) + Og(1) for all P,Q € E(K), 
(ii) h(2P) = 4h(P) + O(1) for all P € E(k), 
(iii) {P € E(K) : h(P) < C} is finite. 


The first two statements are (4.2a,b), whereas the third statement is (5.4) 
and uses the assumption that € — C does not split. We now have all of the 
hypotheses needed to apply the Descent theorem [AEC, VIII.3.1], which 
completes the proof that E(k) is finitely generated under the assumption 
that k is algebraically closed. Finally, for arbitrary constant fields k, it suf- 
fices to observe that E(K) = E(k(C)) is a subgroup of E(k(C)), so E(K) 
is finitely generated. Oo 


Remark 6.2.1. If € — C splits over k, then the group E(K) need not be 
finitely generated. More precisely, if € & Ey x C, then each point z € Fo(k) 
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is also a point in Eo(K) = E(K). Thus there is an inclusion Eo(k) 
E(K). So for example, if k = C, then Eo(k) will certainly not be finitely 
generated, and the same is then true of E(K). However, the quotient 
group E(K)/Eo(k) will be finitely generated. This relative version of the 
Mordell-Weil theorem is due to Lang and Néron; see exercise 3.15. 


Remark 6.2.2. If k is a number field, or more generally if k is a finitely 
generated extension of Q, then the Mordell-Weil theorem (6.1) is true re- 
gardless of whether or not € — C splits. This generalization of the original 
Mordell-Weil theorem for number fields is due to Néron; see exercise 3.4. 


§7. The Geometry of Algebraic Surfaces 


All of our previous work in this chapter has dealt with the birational ge- 
ometry of elliptic surfaces. In order to investigate the finer structure of 
elliptic surfaces, we will need to study them up to isomorphism. This sec- 
tion reviews the basic theory of non-singular algebraic surfaces, including 
especially intersection theory and minimal models. Our main reference will 
be Chapter 5 of Hartshorne [1], specifically §1 for intersection theory and 
section 5 for the theory of minimal models, although we will also need a 
few additional facts from other sources concerning minimal models. For 
more information about surfaces, the reader might consult Beauville [1] 
and Griffiths-Harris [1, Ch. 4]. 

Let S/k be a non-singular surface defined over an algebraically closed 
field k of characteristic 0. A divisor on S' is a formal sum 


n 
D= ys ajl;, 
i=1 


where a; € Z and the [; C S are irreducible curves lying on the surface S. 
The I;’s that appear in the sum are called the components of the divisor D. 
The group of divisors on S is denoted Div(S). 

Recall that the local ring of S at a point P € S is defined to be 


Os.p ={f €k(S) : f is defined at P}. 
Similarly, for any irreducible curve I C S, the local ring of S at T is 


Osr= { f € k(S) : f is defined at some point P € ey — U Os p. 
Per 


Our assumption that S is non-singular implies that each Og p is a discrete 
valuation ring. We denote its valuation by ordy, since intuitively ordr(f) 
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is the order of vanishing of f along [. We extend this as usual to a homo- 
morphism 
ordr : k(S)* — Z, 
and then use this to define a homomorphism 
div: k(S)* —> Div(S), 
f > SS ordp(f)P. 

rcs 
A divisor is principal if it is the divisor of a function div(f). Two divi- 
sors D,, D2 € Div(S) are linearly equivalent if their difference D, — D2 is 
principal, in which case we write D, ~ Do. Linear equivalence is an equiv- 
alence relation on the divisor group Div(S), and the Picard group of S is 
the corresponding quotient group, 


Pic(S) = Div(S)/~. 


Example 7.1. Consider a divisor D = $>a;I'; € Div(P?) in the projective 
plane. To each irreducible curve I; C P? we can associate its degree, and 
by extending linearly we obtain a sree 


deg: Div(P?) —> 


Soali - Soa an iF 


If D = div(f) is principal, then it is easy to check that deg(D) = 0, so the 
degree map induces a homomorphism 


deg : Pic(P?) — Z. 


We will leave it to you (exercise 3.18) to verify that this last map is an 
isomorphism, so Pic(P?) & Z. This is the analogue for surfaces of [AEC, 
11.3.1b]. It says that a rational function on P? has the “same number” of 
zeros and poles. 


In order to study the geometry of a surface, we will look at the curves it 
contains and how those curves intersect. For example, a curve of degree m 
and a curve of degree n in P? will “usually” intersect in mn distinct points, 
and they will always intersect in mn points if we count tangencies and 
singularities with the correct multiplicities. This famous result is known as 
Bezout’s theorem; see (7.3) below. Our next step is to define an intersection 
index for curves and divisors on arbitrary non-singular surfaces. 

Let Ty and [2 be irreducible curves on S, and let P€ Ty NT2. Fix 
local equations f;, fo € k(S)* for T,, 2 around P; that is, choose f; € Og,p 
so that ordp,(f;) = 1 and ordr(f;,) = 0 for every other irreducible curve P 
containing P. We say that [; and Is intersect transversally at P if fi 
and fz generate the maximal ideal of the local ring Og p. (See exercise 3.19 
for the intuition behind this definition.) 

If fy and Is are irreducible curves that meet everywhere transversally, 
then it is natural to define the intersection [, - Tg to be the number of 
intersection points. The next theorem says that this definition can be 
extended in a natural way to all divisors. 
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Theorem 7.2. There is a unique symmetric bilinear pairing 
Div(X) x Div(X) —= Z, (D1, D2) = dD, . Do, 


with the following two properties: 

(i) If 1, and [2 are irreducible curves on S that meet everywhere transver- 
sally, thenT, -T2 = #([, NT2). 

(ii) If D, Dy, D2 € Div(S) are divisors with D; ~ D2, then D-D, = D-Day. 


ProoF. (Sketch) Given divisors D,, Dz € Div(S'), one uses ampleness and 
a Bertini theorem to find divisors Dj, Di € Div(S) with Dj ~ Di, Dg ~ 
Dg, and such that Dj and Dj are sums of irreducible curves that meet each 
other transversally. Then Dj - D4 is defined using linearity and (i). One 
then checks that the answer is independent of the choice of D{ and D3. 
For details, see Hartshorne [1, V.1.1]. oO 


Example 7.3. We have seen ((7.1) and exercise 3.18) that the degree 
map defines an isomorphism Pic(P?) & Z. Let [y,[2 C P? be curves of 
degrees n1,n2 respectively, and let H,, Hz C P? be (distinct) lines. Then 


deg(T;) =n; = deg(n;H;), which implies that T; ~ ni Hi. 


Further, H, - H2 = 1, since distinct lines in P? intersect transversally in a 
single point, so we can compute 


T; : T> => (n1 Ay) * (n2H2) = nyno( Ay . Hp) EPnyne, = deg(T',) deg(T2). 


The equality Ty -T2 = deg(T,) deg(T2) is called Bezout’s theorem for the 
plane. 


Theorem 7.2 is a powerful existence theorem, but it does not give a 
very practical method for computing the intersection D,- Dz. In principle, 
one can find divisors D{ ~ D, and Di ~ D2 which intersect transversally 
and then count the number of points in Dj, MN D$, but in practice it is 
better to assign multiplicities to the points in D; 1 D2. This is done in the 
following way. 

Let D € Div(S) be a divisor, and let P € S. A local equation for D 
at P is a function f € k(S)* with the property that 


P ¢ D—div(f). 


Notice that if D =T is an irreducible curve, then this is equivalent to the 
condition that ordr(f) = 1 and that ordy/(f) = 0 for all other irreducible 
curves I” containing P. 

Now let D,, Dz € Div(S) be divisors, and let P € S be a point which 
does not lie on a common component of D; and Dz. Choose local equa- 
tions fi, fo € k(S)* for D1, Do respectively. The (local) intersection index 
of D, and Dz at P is defined to be the quantity 


(D; - Dz)p = dimy Og. p/(fi, fa): 
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Notice that (Di; -Dz)p = 0 if P € Di N Dag, since if P ¢ D;, then f; = 1 
will be a local equation for D; at P. The next result explains how the 
local intersection indices can be used to calculate the global intersection 
number Dj, - Do. 


Proposition 7.4. Let D,,D2z € Div(S) be divisors with no common 
components. Then the local intersection index (D, - D2)p is finite for 
all P € S, and 


Di-Dz2= > (Di Da)p. 
PED:ND2 


PRooF. See Hartshorne [1, V.1.4]. Oo 


Example 7.5. The local intersection indices are comparatively easy to 
calculate. As illustration, we will compute the intersection index of 
Pi y¥"ZSk* <ands IpsV¥Z=2° 
in P? at the point P = [0,0,1]. (See Figure 3.1.) We dehomogenize x = 
X/Z, y = Y/Z, so the local ring at P is 
f 
Op.» = Karte = {2 € kau) + 9(0.0) #0}. 


Then 
kx, yl(o,o) kilo kilo 
a a <k 2 3 
(P—ay-2) Ga) sy Tete tke, 


where the middle equality follows from the fact that x —1 is a unit in k[zp. 
Hence 


aoe = dim, (k + ke + ka? + ke) = 4. 


Notice that I, is singular at P, but it has a unique tangent line there which 
is the same as the tangent line to [2 at P. This explains why (I, -T2)p is 
so large. 


Remark 7.6. Proposition 7.4 gives a method for computing the inter- 
section number D, - Dz when the divisors D, and D2 have no common 
components. However, one frequently wants to compute the intersection of 
divisors with components in common. An important example is the self- 
intersection D? = D- D of a divisor D. This cannot be calculated directly 
using (7.4). One approach to calculating D? is to find a D’ ~ D such 
that D and D’ have no common components, and then compute D- D’. 
For example, if ! C P? is a curve of degree n, then the computation in (7.3) 
shows that [? = n?. The argument in (7.3) works for self-intersections be- 
cause T ~ nH for any line H Cc P?. In general, it may be difficult to find 
an appropriate D’. Another approach to computing self-intersections is to 
use the adjunction formula; see Hartshorne [1, V.1.5}. 
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The Curves T, : y? = 2° and Tz: y = x? Near P = (0,0) 
Figure 3.1 


If C is a (possibly singular) curve, then there is a unique non-singular 
projective curve C’ such that C is birationally equivalent to C’. (See 
[AEC, II.2.5] or Hartshorne [{1, I.6.11,1.6.12].) The situation for surfaces 
is more complicated. It is true that every surface is birationally equivalent 
to a non-singular projective surface. However, if S is a non-singular pro- 
jective surface, we can always blow up a point P € S to produce a new 
non-singular surface S’ that is birationally equivalent to S but not isomor- 
phic to S. The blown-up surface has the property that there is a birational 
morphism S' — S; that is, the map S’ > S is a morphism, and it has an 
inverse S — S’ that is a rational map. This leads us to make the following 
definition. 


Definition. A surface S is relatively minimal if it has the following two 
properties: 
(i) S is a non-singular projective surface. 
(ii) If S’ is another non-singular projective surface, and if 6: S — S’ 
is a birational morphism, then ¢ is an isomorphism. 


Theorem 7.7. Every surface So is birationally equivalent to a relatively 
minimal surface S. If the original surface So is non-singular, then there is 
a birational morphism So > S. 


PROOF. This theorem is really an amalgamation of two important results 
in the theory of algebraic surfaces. First, resolution of singularities tells 
us that every surface is birationally equivalent to a non-singular projective 
surface. See Hartshorne [1, V.3.8.1] for a discussion of resolution and a 
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history of its proof. Second, every non-singular projective surface is bira- 
tionally equivalent to a relatively minimal surface (Hartshorne [1, V.5.8]). 

We will not prove either of these results but will content ourselves with 
a few brief remarks. In order to prove resolution of singularities, one starts 
with the surface Sg and continually blows up points and curves until all of 
the singularities disappear. Of course, the hard part is to show that after 
each blow-up, the singularities have become quantitatively better. 

A curve C C Sp is called exceptional if C ¥ P! and C? = —1. Castel- 
nuovo’s criterion (Hartshorne [1, V.5.7]) says that if C is an exceptional 
curve on So, then there is a non-singular surface S; and a birational mor- 
phism @ : So — S) with the property that ¢ is an isomorphism away from C' 
and ¢ sends C to a point P; € S;. We say that ¢ is the blow-down of the 
curve C’, since ¢ is the blow-up of S; at P,. If S; has any exceptional 
curves, we choose one and blow it down. Continuing in this fashion, we 
obtain a sequence of surfaces So, .S,,S2,.... The hard part is to show that 
this process terminates with a non-singular surface S' which contains no ex- 
ceptional curves. Then one shows that such a surface is relatively minimal. 
See Hartshorne [1, V.5.8] for details. This also proves the last part of (7.7), 
since the blow-down maps Sg — S; — So — --- are all morphisms. oO 
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An elliptic surface 7: € > C is an example of a fibered surface; that is, a 
surface that is described as a collection of fibers €; = 7~!(t) parametrized 
by the points t of a curve C. Other examples of fibered surfaces include 
ruled surfaces, which are surfaces whose fibers are all isomorphic to P! 
(Hartshorne [1, V §2]), and products Cy, x C2, which can be made into 
fibered surfaces in two ways by using the projections onto either the first 
or second factor. In this section, we will prove some geometric properties 
that are true for all fibered surfaces. In subsequent sections, we will apply 
these results to our study of elliptic surfaces. 


Definition. A fibered surface is a non-singular projective surface S', a non- 
singular curve C, and a surjective morphism 7: S — C. For any t € C, 
the fiber of S lying over t is the curve S; = 77 1'(t). Note that S; will be a 
non-singular curve for all but finitely many t € S. 


The irreducible divisors on a fibered surface naturally divide into two 
different sorts, those that lie in a single fiber and those that cover C’. More 
precisely, let I C S be an irreducible curve lying on a fibered surface 7 : 
S —C. Then 7 induces a map of curves 7: T — C that is either constant 
or surjective [AEC, II.2.3]. If it is constant, say a([) = {t}, then T lies 
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\ n 
C 
A Fibered Surface with Horizontal and Fibral Curves 
Figure 3.2 


entirely in the fiber S;, and we call T fibral. If not, then 7: T - Cisa 
finite map of positive degree, and we call T horizontal. See Figure 3.2. 


Definition. A divisor D € Div(S) on a fibered surface S is called fibral if 
all of its components are fibral. D is called horizontal if all of its components 
are horizontal. Note that every divisor can be uniquely written as the sum 
of a horizontal divisor and a fibral divisor, since every irreducible curve is 
either horizontal or fibral. 


Let 7: S — C be a fibered surface. If t € C, then the components 
of x—1(t) are irreducible fibral divisors. We assign multiplicities to these 
components in the following way. Let u, € k(C) be a uniformizer at t, that 
is, ord;(w) = 1. Then u; o 7 is a function on S, so we can take its divisor, 
or more precisely that part of its divisor lying in the fiber S;. Extending 
linearly, this gives us a homomorphism from Div(C) to Div(S). 


Definition. Let 7: S — C be a fibered surface, and for each t € C, fix a 
uniformizer u; € k(C) at t. We define a homomorphism 


n* : Div(C) — Div(S), 


> nt(t) > > ne » ordp (uz o 7)T, 


tEeC 1eG TCS; 


where the inner sum on the right is over all irreducible curves [ contained in 
the fiber S, = 7~1(t). It is easy to see that 1* is independent of the choice 
of uniformizers uz, since if ui) is another uniformizer, then (u;/u/)(t) 4 0, co 
at t. Hence (u;/u}) om is not identically 0 or co on any component of S:, 
so ordp(u; 0 7) = ordp(uj o 77). 
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It is clear from the definition of m* that the divisors in 1*(Div(C)) 
are fibral. We begin by showing that they have trivial intersection with all 
other fibral divisors. 


Lemma 8.1. Let 7: S — C bea fibered surface, let 6 € Div(C), and 
let D € Div(S) be a fibral divisor. Then D-1*6 = 0. 


PROOF. Using the linearity of the intersection pairing and the fact that 7* 
is a homomorphism, we may assume that D is an irreducible fibral divisor 
and that 6 = (t) consists of a single point. Then 7(D) consists of one point. 
If that one point is not t, then D and 7*(t) have no points in common, so 
clearly D-7*6 =0. 

We have reduced to the case that 7(D) = {t}. To complete the 
proof, we will move 6 = (t) by a linear equivalence. We can choose a 
non-constant function f € k(C)* by applying the Riemann-Roch theorem 
for curves [AEC, II.5.5c] to the divisor (2g + 1)(t) € Div(C), where g is 
the genus of C. Riemann-Roch then says that £((2g + 1)(t)) = g +2, so in 
particular there exists a non-constant function f whose only poles are at t. 

Let ord;(f) = —n, and consider the divisor 


n*(né + div(f)) = n7n*6 + div(f om). 


The point ¢t does not appear in the divisor né + div(f), so the left-hand 
side has no points in common with S;, and hence it has trivial intersection 
with D. On the other hand, div(f 0 7) is linearly equivalent to 0, so it has 
trivial intersection with every divisor on S. Intersecting both sides with D, 
we find that 


0 = D-n*(né + div(f)) = D- (nx*6 + div(f o7)) =nD- n°. 
Hence D- 1*6 = 0. oO 


We now show that the intersection pairing is negative semi-definite 
when it is restricted to fibral divisors, and we calculate its null space. 


Proposition 8.2. Let: S — C bea fibered surface, and let D € Div(S) 
be a fibral divisor on S. 

(a) D? <0. 

(b) D? = 0 if and only if D € x*(Div(C) @ Q). In other words, D* = 0 
if and only if there is a divisor 6 € Div(C) such that aD = br*é for some 
non-zero integers a,b € Z. 


PROOF. (a) Write D = D,; +---+ D,, where each D; is contained in a 
different fiber. Then D;-D; = 0 for i # j, since they have no points in 
common, which implies that D? = D? +---+ D2. It thus suffices to prove 
the proposition for each D;, so we may assume that D is contained in a 
single fiber, say D C S}. 
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Write 


as a sum of irreducible divisors. It is clear from the definition of 7* that 
the n,’s are all positive. Our assumption that D C S; means that D has 
the form 


T 
D= y al; for some integers a,. 
i=0 


We rewrite D and define another divisor D’ by the formulas 


r Ae 2 
ay a; 

D= ) 7, ales and D’ = 2 ni nals). 
1i=0 = g 


Proposition 8.1 tells us that D’. F = F-D’ = 0. We use this to 
compute 


2D? = D!. F—2D*+F.-D’ 


a? 7 aja; 
= 72 (mili) (nyP5) — 2 S52 (nil i) - (50) 
ij=0 


oy MN; 


The terms with 7 = j in this last sum are zero, so we find that 


Ie a; a; : 
pat (2 | (niVi) - (nj). 


For i # j, the divisors [; and Tj are distinct irreducible divisors, 
so T;-I; > 0. Further, as noted above, the multiplicities no,...,n, are all 
positive, so 

(niT,) -(njTj) > 0 for alli #j. 


This immediately implies that D? < 0, which completes the proof of (a). 
(b) Suppose now that D? = 0. Then the formula for D? shows that 

“= for all i,j such that [;-T; > 0. 

on 15 
In other words, the ratios a;/n; and a;/n,; will be the same if the divi- 
sors I; and I; have a point in common. On the other hand, it is a general 
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fact that the fibers of a fibered surface are connected. This is a special 
case of Hartshorne [1, III.11.3], or see exercise 3.21. So given any two 
components I; and Ij, we can find a sequence of components 

Pee Dis Pas yee splae Ss Ly 
with Ty, -Tj,,, > 0 for all k = 0,1,...,m—1. Hence a;/n; = aj/nj for 
alli and 7. Let a = ag/no € Q be this common ratio. Then 


n 
i=0 ¢ i=0 


i=0 


which completes the proof of Proposition 8.2. Oo 


Remark 8.2.3. With notation as in the proof of (8.2), consider the inci- 
dence matrix 
T= (Ui -Vy)osijsr 


which describes how the components of the fiber S; intersect one another. 
Then (8.2) may be restated as follows: The quadratic form 


QV —Q@ ar ‘ala, 


is negative semi-definite, with one dimensional null space spanned by the 
vector (no,-...,7,-). In particular, det(I) = 0, but every det(J;;) 4 0, 
where J;; is the minor obtained by deleting the i*® row and i*® column of J. 


Next we show that for a large class of divisors it is possible to add on 
a fibral divisor so that the sum will have trivial intersection with all fibral 
divisors. We will use this construction in the next section to describe the 
canonical height in terms of intersection theory. 


Proposition 8.3. Let 7: S — C bea fibered surface, and let D € Div(S) 
be a divisor on S with the property that 


D-x*(t)=0 for some (every) t € C. 
(The quantity D-x*(t) is independent of t; see exercises 3.22 and 3.23 or 
Hartshorne [1, exercise V.1.7].) Then there exists a fibral divisor Bp € 
Div(S) @ Q such that 
(D+@®p)-F =0_ for all fibral divisors F € Div(S). 


If ®', is another divisor with this property, then 


®, — Bp € 1*(Div(C) @Q). 
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In other words, ®p is uniquely determined by D up to divisors that come 
from C. 


PROOF. We are going to try to write ®p in the form }° arT and solve for 
the coefficients ar. More precisely, for every point t € C, write 


Tt 
17* (t) => > neil ti 
i=0 


as a sum of irreducible components. We set azo = O for all t. Further, 
when Tr; > 1 we consider the following system of linear equations: 


rt 
So anlis Ty = -D-Ty, l<jg<re. 
i=l 


Note we are discarding the 0**-component I';9, so this is a system of r; 
equations in the r; variables a,;. Proposition 8.2 says that the incidence 


matrix 
Vee Deg )i<ij<re 


has non-zero determinant (see also (8.2.3)), so this system of equations has 
a unique solution in rational numbers az; € Q. 
We claim that the divisor 


Op = xs s anil ti 


tec i=0 


has the desired property. Note that this is a finite sum, since r; = 0 for all 
but finitely many ¢t, and az = 0. To check that ®p works, it suffices by 
linearity to show that (D+©®p)-F = 0 for every irreducible fibral divisor F’. 
Each irreducible fibral divisor has the form F = [;; for some t € C and 
some 0 < 7 < 7;. We consider three cases. 

First, if ry = 0, then F = Tyo = 1*(t). Using (8.1) and the assumption 
that D-7*(t) = 0, we find that 


(D+ @p)-F=D-r*(t)+®p-n*(t) =0. 


Second, suppose that r; > 1 and F =1T,; with 7 > 1. Then the fact 
that the a;;’s give a solution to the system of linear equations allows us to 
compute 


(D+®p)-F=D-VMyt+ >So aT Ty =0. 


tec i=1 
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c 
t 
A Reducible Fiber on a Fibered Surface 
Figure 3.3 


Finally, we consider that case that r; > 1 and F =T jo. Then 
0=(D+4+@p)-a*(t) from (8.1) 


= iS nti(D + ®p) T:; since m*(t) = bs Neil 
i=0 
=nio(D+®p)-Tro from the previous case. 


This completes the proof that (D + ®p)- F = 0 for all fibral divisors F. 

It remains to show that ®p is unique up to addition of a divisor from C. 
Suppose ®4, is another divisor with the same property. Then for every fibral 
divisor F' we have 


(®, -®>):F =(D+@p)-F-(D+5):-F=0. 


But ®, — ©', is itself fibral, so (®p) — ©)? = 0. It follows from (8.2b) 
that ®, — ®, is in m*(Div(C) @ Q). Oo 


Example 8.3.1. Let 7: S — C be a fibered surface, and suppose that 
the fiber S; consists of four components arranged in the shape of a square 
with transversal intersections, as illustrated in Figure 3.3. In other words, 


1 ift—j =+1(mod 4), 


m(t)=Lo+T, +T2+T3 with eet if i— j = 2(mod 4). 
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We can use (8.1) to compute the self-intersections of the components. 
For example, 


O=VTo-n*(t) =T24+To-Tit+lo-Po+lo-T3 =F§+14+04+1, 


so [2 = —2, and similarly '? = —2 for the other i’s. Thus the incidence 
matrix for this fiber is 


~—2 1 0 1 
1 -2 1 0 
Pas-Tiosiiss={ g 4 2 1 
LQ Bee 


Suppose now that D € Div(S) is a (horizontal) divisor with 
D-To=-1, D-T,=1, D-T2=0, D-T3 =0. 


For example, D might consist of two curves D = D, — Do each of which 
maps isomorphically 7: D; — C, with D, going through T; and Do going 
through To. To find the part of ®p lying over t, call it p+, we take 
®p1 = aly + a2F2 + ass, set (D+@p,)-T; = 0 for 7 = 1, 2,3, and solve 
for the a,’s. Doing this gives 


3 1 1 
®5,=—-T —— -T3. 
Dyt git a ages 


The reader can check that (D+ ®p,;)-T; =0 for 0 <i < 4. Similarly, if D 
were to satisfy 


Devt 2 Ae Py Sa; HeryH ie, ers = 


then : i 
Dp+ = gh + T, + gis 


See exercise 3.24 for a generalization to the case that the fiber is an n-gon 
with transversal intersections. 


The final topic for this section is minimal models of fibered surfaces. 
These will be minimal models which respect the fact that the surface is 
fibered. More precisely, we might say that a fibered surface S — C is 
relatively minimal if for every fibered surface S’ — C, every birational 
map S — S’ commuting with the maps to C is a morphism. In the case 
that the non-singular fibers of S — C have genus at least 1, then it turns 
out that there is a unique relatively minimal model. Further, this model 
will have the stronger minimality property described in the next theorem. 
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Theorem 8.4. Let S — C be a fibered surface with the property that 
its non-singular fibers are curves of genus at least 1. Then there exists 
a fibered surface S™™ — C and a birational morphism ¢ : S — S™™ 
commuting with the maps to C' with the following property: 

Let S’ — C be a fibered surface, and let ¢' : S’ > S' be a birational 
map commuting with the maps to C. Then the rational map ¢o ¢’ extends 
to a morphism. In other words, the top line of the commutative diagram 


, 


Sa e » § Pig Smin 
Se hb. 
C 


extends to a morphism. 


PROOF. The basic idea is as follows. For any given S, let S™™ be obtained 
from S by blowing down all of the exceptional curves on the reducible 
fibers. Next, given an S’ birational to S, take the resulting birational 
map $’"" — 9™n and factor it into the smallest number of quadratic 
transformations 


gm = Sy ~-2 Sy --» Sy --9 ++ --9 S, = Sm 

Then by studying the behavior of the exceptional curves on these quadratic 
transformations, one shows that it is possible eliminate one of the “blow- 
up~blow-down pairs.” In other words, if n > 1, then one shows that $’"™ 
and S™™ are connected by a smaller chain of quadratic transformations. 
Hence S’"" and S™ are isomorphic, which gives the desired result. 

Unfortunately, we do not have at our disposal the tools needed to turn 
this brief sketch into a rigorous proof. We refer the reader to Lichten- 
baum [1, Thm. 4.4] or Shafarevich [2, p. 131] for the complete proof of 
Theorem 8.4. Oo 


Definition. It is clear that the surface S™™ described in (8.4) is uniquely 
determined up to a unique isomorphism commuting with the maps to C. 
A fibered surface S — C is called a minimal fibered surface (over C) if it 
is equal to S™”, 
Corollary 8.4.1. Let «: S —C be a minimal fibered surface over C, 
and let r : S — S be a birational map commuting with the map to C 
(ie, 707 =). Then 7 is a morphism. 
PRoor. By assumption, S$ is minimal, so the map ¢: S > S™ in (8.4) is 
an isomorphism. Now applying (8.4) with S = S’ and ¢’ = 7, we deduce 
that the composition 

s—+3—gmin 


is a morphism. Hence the same is true of r = ¢7! 0 (07). im 
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Let +: € = C bea minimal elliptic surface, and let E/K be the associated 
elliptic curve over the function field K = k(C) of C (3.8). Recall (3.10c) 
that each point P € E(K) corresponds to a section op : C — €. We define 
a translation-by-P map 

TprE—€E 


on € by using the translation-by-o p(t) map on each non-singular fiber €,. 
It is clear that Tp is a birational map, since it is certainly given by rational 
functions and it has the rational inverse r_p. The minimality of € then 
implies that rp extends to a morphism. We record this important fact in 
the following proposition. 


Proposition 9.1. Let 7: € — C be a minimal elliptic surface with 
associated elliptic curve E/K. 
(a) For any point P € E(K), the translation-by-P map 


TpiE—E 


extends to an automorphism of €. 
(b) Let 


Aut(€/C) = {automorphisms 7 : € — € satisfying 70 T =}. 


Then the map 
E(k) — Aut(E/C), P+-— Tp, 


is a homomorphism. 


PROOF. (a) This a special case of Corollary 8.4.1, which says that any bi- 
rational map of a minimal fibered surface to itself extends to a morphism. 

(b) If the fiber €; is non-singular, then tp maps €; to itself by definition. 
It follows that 70 7Tp = 7 on all non-singular fibers. But the non-singular 
fibers are Zariski dense in €, and a morphism is determined by its values 
on any Zariski dense set (Hartshorne [1, [.4.1]), so 70 Tp = 7 on all of E. 
This proves that tp € Aut(€/C). Similarly, the identity tp4gq = Tp TQ is 
clearly true on all non-singular fibers, so it is true everywhere. Finally, to 
is the identity map, which completes the proof that E(K) — Aut(€/C) is 
a homomorphism. oO 


Let P € E(k) with corresponding section gap : C — €. The im- 


age op(C) of op is a curve on the surface €, which we can think of as a 
divisor on €. We will write 


(P) € Div(€) 
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for this divisor. It is important to note that the divisors 


(P)+(Q) and (P+ Q) 


are very different. The former is the sum of the two divisors (P) and (Q) 
in Div(€), whereas the latter is the image of the section op,g = op +9Q 
which is defined using the group law on E. The following proposition shows 
how they are related. 


Proposition 9.2. With notation as above, let P,,...,P, € E(K) be 
points, and let n1,---,n, € Z be integers such that 

[nyJPy +--+ + [n-|P, =O. 
Letn=n,+---+n,. Then the divisor 


n(P\)+---+n,(P,) — n(O) € Div(€) 


is linearly equivalent to a fibral divisor. 
In particular, for all P,Q € E(K), the divisor 


(P + Q) — (P) — (Q) + (0) 
is linearly equivalent to a fibral divisor. 


Proor. If P € E(K), our notation (P) is potentially ambiguous, since we 

could mean either the divisor on the curve E/K consisting of the point P, 

or the divisor on the surface € consisting of the curve ap(C). To resolve 

this difficulty, we will denote the former by (P) and the latter by (P)e. 
Fix a Weierstrass equation for F'/K, say 


E:y=22+Arc+B, A,Be kK, 
and consider the divisor 
D=n (Piet::-+7,(P,)e — 2(O)g € Div(E). 


By assumption, D has degree 0 and sums to the zero element of E(K). 
Applying [AEC, III.3.5] to the divisor D on the elliptic curve E/K, we find 
that D is linearly equivalent to 0. Thus there is a function f € K(£) such 
that 

D = (div f)z € Div(£). 


The relationship (3.8) between FE and € says that K(E) = k(€), so 
we can consider f as an algebraic function on the surface €. When we 
compute its divisor on €, we find that 


(div fe and ny(Pi)e +--+ (Pe — n(O)e 
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are almost the same. To see this, note that f € K(E) = k(C)(x,y) isa 
rational function in x and y with coefficients in k(C). Hence for all but 
finitely many t € C’, we can evaluate those coefficients to get a function f; € 
k(€,) whose divisor will be precisely 


ni(op,(t)) +--+ +nr(op,(t)) — n(ao(t)) € Div(E,). 
This proves that the difference 
(div f)e — (mi (Pie +--+ +n(P-)e —n(O)e) 
is contained in finitely many fibers, hence it is fibral. oO 
For any point P € E(k), the divisor (P) — (O) € Div(€) satisfies 
((P)-—(O))-n*(t)=0 — forallt EC. 


This is true because the image of a section will intersect a fiber 1*(t) 
exactly once. (See exercise 3.22.) This shows that we can apply (8.3) to 
the divisor (P) — (O), as in the following definition. 
Definition. For each point P € E(K), let Bp € Div(E) ® Q be a fibral 
divisor so that the divisor 
def 
Dp & (P)—(O)+ 6p 


satisfies 
Dp-F=0 for all fibral divisors F’ € Div(€). 


Such a divisor exists by (8.3) and the remarks made above. Then we define 
a pairing on E(k) by the formula 


(-,-): E(K) x E(k) —Q 
(P,Q) = —Dp- Dg. 


The next result shows that this geometrically defined pairing is equal 
to the canonical height pairing (4.3), which justifies our use of the same 
notation for the two pairings! This geometric construction of the canonical 
height is due to Manin [1]. See also Shioda [2] for a more detailed analysis 
of the induced Euclidean structure on the lattice E(K)/E(K )tors- 


Theorem 9.3. (Manin [1]) Let 7: € + C be a minimal elliptic surface 
with associated elliptic curve E/K. The pairing 


(-,:): E(K) x E(k) — Q, (P,Q) =—Dp- Dg, 


defined above has the following two properties: 
(a) (-, +) is bilinear. 
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(b) (P,P) = h(P) + O(1) for all P € E(K), where we recall from §4 
that h(P) = h(xp) is the degree of the map zp: C — P!. 

Hence this pairing agrees with the canonical height pairing defined 
in (4.3). In particular, h(P) = 3(P, P) € Q for all P€ E(K). 


Proor. (a) Let P,Q, R € E(K) be any three points. Applying (9.2), we 
choose a fibral divisor F’ such that 


(Q+ R)—-(Q)-(R)+ (0) ~F. 
Then using standard properties of the intersection pairing (7.2), we com- 
pute 
=-Dp-Do1ir+Dp-Dgo+Dp-Dr 
=—Dp- ((Q+ 8) — (O) + ®a+Rr) + Dp: ((Q) — (O) + gq) 
+ Dp: ((R) - (0) +n) 
= —Dp-((Q+ R) — (Q) —(R) + (0) + Garr — Bq — Gp) 
=-Dp.- (F + Goir -— Po - @p) 
=0. 


The last line follows from the fact that Dp has trivial intersection with all 
fibral divisors. Hence 


(P,Q + R) = (P,Q) po (P, R) =0. 
It is also easy to check that the pairing is symmetric, 
(P,Q) = —Dp - Dg = —Da: Dp = (Q, P). 


This completes the proof that the pairing is bilinear. 
(b) Directly from the definition we find that 
(P,P) =-Dp-Dp 
= —((P) -(O)+®p)- Dp 
=—((P)-(O))-Dp_ since Dp - (fibral) = 0 
= 2(P) - (0) — (P)? ~ (0)? + ((P) — (O)) - Bp. 
Our first claim is that (P)? does not depend on P. To see this, consider 


the translation-by-P map 
Tp: E— €E. 


We know from (9.1a) that 7p extends to an automorphism of €. It follows 
that 75D, -7T$D2 = D, - D2 for any two divisors D,, Dz € Div(€). Hence 


(P)-(P) = tp(P) - tp(P) = (O) - (O) 
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is independent of rp. Alternative approach: first show that the canoni- 
cal divisor on € has the form 2*6, and then use the adjunction formula 
(Hartshorne [1, V.1.5]) to compute 


(P)? = 29(C) — 2 — (P) - 2*6 = 29(C) — 2 — degé. 


Our second observation is that although ®p depends on P, there are 
essentially only finitely many choices for ®p. More precisely, for each t € C, 
write 


Tt 
a (t) = Se Nel i 
i=0 


as a sum of irreducible components. Note that r; = O for all but finitely 
many t € C. Looking back at the proof of (8.3), we see that ®p can be 
written in the form 


Pp = Se date + m* (6) 


tEC i=1 


for some 6 € Div(C), where the integers a,; are uniquely determined by 
the finitely many intersection indices 


But every (P)-T;; is either 0 or 1, so as we take different points P € E(K), 
there will be only finitely many possibilities for the a;;’s. Further, 


((P) — (O)) -*(6) = (P) + 1*(6) — (O) - 1*(5) = deg(5) — deg(5) = 0 


from exercise 3.22(b), so we find that 


((P) - (0)) p= ((P)- (0) (Laan) 


t6C 451 


can take on only finitely many values as we vary P € E(K). 
Combining these two observations with the calculation from above 
yields 
(P, P) = 2(P)-(O) + O(1) for P € E(K). 


It remains to calculate the intersection index (P)-(O). Adjusting the O(1) 
if necessary, we may assume that [2)P # O. Fix a Weierstrass equation 
for E, 

E:y=2°4+ Axc+B, 


and write P = (xp,yp). Changing coordinates if necessary, we may as- 
sume xp and yp have no poles in common with the poles of A and B. 
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Let t € C. We will compute the local intersection index of (P) 
and (QO) at the point o0(t). We denote this local intersection by (P - O);. 
If ord;(xp) > 0, that is, if ep(t) A oo, then (P) and (O) do not intersect 
on the fiber €;, so (P-O), = 0. Suppose now that ord;(xp) < 0, so the 
equation for £ tells us that 


3ord;(ap) = 2ord:(yp). 


We make a change of coordinates w = x/y, z = 1/y, so E now has the 
equation 
E:z=w>+ Awz? + B23, 


and P = (wp,zp) = (ap/yp,1/yp). Also let u € k(C) C k(E) be a 
uniformizer at t, so we are looking at the intersection of (P) and (O) at 
the point (w,z,u) = (0,0,0). The local ring of € at this point is 


k[w, 2, ul(o,0,0) 
(z —w — Awz? — Bz3)’ 


Further, in this ring, (P) has the local equation w — wp = 0 and (O) has 
the local equation w = 0, so by definition the intersection index (P,O), is 
equal to the dimension over k of the vector space 


k[w, z,u](0,0,0) ~ kz, u](0,0) ~ k{ulo 
(z — w3 — Awz? — Bzi,w-—wp,w) (2 - Bz3,wp)” (wp) 


Note the last equality follows from the fact that z — Bz? = z(1 — Bz?) 
and 1 — Bz? is a unit in k[z, ule). If we write wp = u°w'p for some 
function wp that is neither 0 nor oo at t, then we have 


and also i 
e = ord; wp = ord;(xp/yp) = a5 ord,(ap). 


This proves that 


af if ord;(ap) > 0, 
(P-On={ 04 ordiler) ifordstee) <0 


Adding over t € C’ gives 


(P)(0)=S(P-On= YS 5 muse 5 dealer) = 5h(P). 
teC tEC, ordy(xp)<0 
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Hence 
(P,P) = 2(P)-(O) + O(1) = A(P) + O(1), 


which completes the proof of (b). 

Let g(P) = 4(P, P) be the quadratic form associated to our pairing. 
Then g(P) = $h(P) + O(1) from (b), whereas the bilinearity in (a) tells 
us that g(2P) = 4g(P). This shows that g satisfies properties (i) and (ii) 
of (4.3b), so by the uniqueness (4.3e) of the canonical height, we have g = h. 

Oo 


Theorem 9.3 shows that the canonical height on F(A’) can be com- 
puted using intersection theory. Our next goal is to define a natural pair- 
ing on a certain subgroup E(K)o of E(k). This pairing takes its values 
in Pic(C), and the composition , 


deg 


E(K)o x E(K)y — Pic(C) “3 Z 


will be the canonical height pairing. We begin by describing E(K)o. 

Let P € E(K) be a point and rp : € > € the translation-by-P auto- 
morphism. We know from (9.1b) that tp gives an automorphism of each 
fiber €,. In particular, it must permute each of the components of €,. 


Definition. Define a subset E(K)o of E(K) by 
E(K)o = {P € E(K) : tp(¥) =P for all fibral curves Tc €}. 


Lemma 9.4. E(K) is a subgroup of finite index in E(K). 


PRooF. Let P,Q € E(K)o. From (9.1b) we know that tp49 = Tp ° Ta, 
so for any fibral curve we have tpyQ(C) = tp(7Q(C)) = te(l) = T. 
Therefore P+ Q € E(K)o. Similarly, P = to(T) = 7_p4p(T) = 7_p(T), 
so —P € E(K) . This proves that E(K’)o is a subgroup of E(K). 

For the second part, we observe that if €; is an irreducible fiber, then 
clearly rp(E,) = E,. Let {T1,...,[0,} be the set of all components of the 
reducible fibers of €. It is a finite set, since € has finitely many reducible 
fibers, and each reducible fiber has finitely many components. Then E(k) 
acts on this set by 


BY == AGT Pe 8s, 
BP Shem (Ti > tp(Ti)). 


In other words, there is a homomorphism from E(K) into the symmet- 
ric group $, on r letters. From the definition of E(K)o, the quotient 
group E(K)/E(K)o injects into 8,, which proves that E(K)o is a sub- 
group of finite index. 

O 
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Remark 9.4.1. There is another way to characterize F(K)o in terms of 
the sections op : C — € associated to points P € E(K). Let 09: C > € 
be the zero-section. Then a point P € E(K) is in E(K)po if and only if 
the curves op(C) and o9(C) hit the same component of every fiber of €. 
We will study the group E(K) and the quotient E(K)/E(K)po in greater 
generality and detail in the next chapter; see (IV.6.12), (IV.9.1), (IV.9.2) 
and exercise 4.25. 


For any two points P,Q € E(K), Proposition 9.2 tells us that there is 
a fibral divisor ®p.g € Div(€) satisfying 


(P+ Q) — (P) — (Q) + (O) ~ ®paq. 


Clearly, ®pg is determined by P and Q up to principal divisors, so its 
class in Pic(€) is well-defined. This gives a pairing on E(K) with val- 
ues in Pic(€). The next result shows that this pairing is quite nice when 
restricted to the subgroup E(K)o. 


Theorem 9.5. Let a:&€—C bea minimal elliptic surface with associ- 
ated elliptic curve E/K. 
(a) Let P,Q € E(K)o. Then there exists a divisor [P,Q] € Div(C) such 
that 
(P+Q) -(P)-(Q)+ (0) ~ m*([P,Q]). 

The divisor [P,Q] is determined by P and Q up to linear equivalence. 
(b) The pairing 

E(K)o x E(K)o — Pic(C), (P,Q) += class[P, Q], 


is a well-defined symmetric bilinear pairing. (See also exercise 3.26.) 


(c) 
(P,Q) = deg[P,Q] for all P,Q € E(K)o. 


In particular, h(P) = 4 deg[P, P] for all P € E(K)o. 


PROOF. (a) For any two points P,Q € E(K), let pg € Div(E) be a fibral 
divisor satisfying 


(P+ Q) — (P) —(Q)+(O) ~ pq 
as described in (9.2). Then for any fibral divisor F € Div(€), 
Deg hae tO) PP) =(QeF+lOp ek 
=19(P)-F-(P):-F-—19(O)-F+(O):F 
= T9((P) — (O))-F — ((P) - (O)) -F 


SCP) (0) 9a) (PO) F 
=0 since Q € E(K)o implies T_9(F) = F. 
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But ®pq itself is fibral, so we deduce that £39 = 0. It follows from (8.2b) 
that there is a divisor [P,Q] € Div(C) ® Q such that ®pg = n*([P,Q]). 
This will suffice for our purposes in this chapter, so we will leave it for the 
reader (exercise 3.28c) to show that [P,Q] is actually in Div(C). 

The divisor ®p g is clearly determined by P and Q up to linear equiv- 
alence on €. In order to show that [P,Q] is determined up to linear equiv- 
alence on C, we will prove that if 6 € Div(C) satisfies 7*6 ~ 0, then 6 ~ 0. 
Write 7*6 = div(f) for some f € k(€). For all but finitely many t € C 
we can restrict f to the fiber €; to get a rational function f; € k(€4). By 
assumption, the poles and zero of f lie on finitely many fibers, so for almost 
all t € C we see that f; € k(€;) has no zeros or poles. It follows that f; 
is constant. Let a : C — E be any section, for example the zero section. 
Then the fact that f is constant on almost all fibers means that the func- 
tion f — f 00 07 is identically 0 on those fibers. But a rational function 
is determined by its values on any non-empty open set, so f = fogorn. 
Therefore 


n*6 = div(f) = div(foaon) =n*(div(foo)), 


so 6 = div(f 0@) is a principal divisor on C. Notice that what this result 
really says is that the natural map 7* : Pic(C’) — Pic(€) is injective. 

(b) The pairing is well-defined from (a), and it is clearly symmetric. To 
see that it is bilinear, we let P,Q, R € E(K)o and compute 


Spain ~(P+Q+R)-—(P)-(Q+R)+(O) 
=(P+Q+R)-(P+R)-(Q+R)4+(R) 
qeP ep GP) te) (O) 

= tr((P + Q) — (P) — (Q) + (O)) + ((P + R) — (P) - (R) + (O)) 

~TR(®p.q)t+ Oper 

= Opa + Opp. 
Note that the last equality is true because R € E(K’)o, so TR fixes the fibral 
divisor ®p.g. Now write each ®x y as 7* ([X ,Y]) and use the fact proven 


above that 7* : Pic(C) — Pic(€) is injective. This yields the desired result, 


[P,Q+ R] ~ [P,Q] +[P, RI. 


(c) Let P € E(K)o. Then for every fibral divisor F € Div(€) we have 
(P)- F = 1p(0)- F = (0) -t-p(F) = (0): F. 
In other words, 


((P) — (O)).F =0 for all fibral divisors F € Div(€), 
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so in the notation of (9.2), Dp = (P) — (O). Note that this is only valid 
for points in E(K)o. Let P,Q € E(K)o. We compute 
—(P,Q) =Dp-Dg by definition of (-, -) 
= ((P) — (O)) - ((Q) — (O)) from above 
= (P)-(Q) —(P)-(O eo 
= T_p(P) -7-p(Q) — t-p(P) -r-p(O) — (Q) -(O) + (0) - (0) 
=(O)} (=P +Q)=(0) “(-P) = (Q) (0) +0) (@) 
(HP +Q) = (=P) =(Q) (0) 40) 
=n" ([-P,Q]) -(O) by definition of [-, -] in (a) 
= deg|—P, Q] from exercise 3.22(b) 
= — deg|P, Q] by linearity of [-, -] from (b). 
This proves that (P,Q) = deg[P, Q] for all P,Q € E(K)o. Putting P = Q 


gives h(P) = 5 (P, P)= 4 deg[P, P], which completes the proof of Theo- 
rem 9.5. 


oO 


Remark 9.6. Theorem 9.5 says that the canonical height pairing (-, -) en- 
dows E(K)o with the structure of a Euclidean lattice whose inner product 
takes integer values. Similarly, the height pairing gives E(K) a Euclidean 
structure with an inner product taking rational values having severely lim- 
ited denominators. It is an interesting problem to classify the possible lat- 
tice structures on E(K’)p and E(K). In a series of papers, T. Shioda [1,2,4— 
7] has investigated these Mordell-Weil lattices and proven many interesting 
results, including the construction of examples for which E(K’)9 is isomor- 
phic to a root lattice of type E.6, E7, and Eg. 


Remark 9.7. Let E€ — C be a non-split minimal elliptic surface, and 
let E/K& be the associated elliptic curve. The Néron-Severi group of €, de- 
noted by NS(€), is the group of divisors modulo algebraic equivalence. (For 
the definition of algebraic equivalence, see Hartshorne [1, exercise V.1.7].) 
One can prove that NS(€) is a finitely generated group and that the inter- 
section pairing on Div(€) gives a well-defined pairing on NS(€). It is thus 
an interesting question to relate NS(€) and its intersection pairing to E(K) 
and its height pairing. Shioda [3, Thm. 1.1] has shown how to find genera- 
tors for NS(€) by using generators for F(A’) and fibral components of €. In 
particular, he proves the fundamental rank relation (Shioda [3, Cor. 1.5]) 


rank NS(€) = rank E(K) + 2+ Sor - 1), 
teC 


where r; is the number of irreducible components in the fiber €;. He also 
gives a formula relating the intersection regulator of NS(€) to the canonical 
height regulator of E(K). 
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Let € — C be an elliptic surface defined over a number field k. For each 
point t € C(k) such that €; is non-singular, there is a canonical height 
function h : €:(k) — R on the elliptic curve €;. In the next section we 
will investigate how the canonical height varies from fiber to fiber, espe- 
cially for points lying on the image of a section 0 : C — €. To carry out 
this investigation, we will need to develop more fully the theory of height 
functions on varieties. 

For our purposes in this chapter, it would suffice to consider only 
curves and surfaces, but the theory is hardly more difficult for general 
varieties. We will, however, need to assume that the reader is familiar 
with standard properties of divisors on varieties, as covered for example 
in Hartshorne [1, II §§6,7]. Some of the proofs in this section are fairly 
technical, so some readers may want to read the definitions and statements 
of the main results (10.1, 10.2, 10.3) and then proceed directly to the next 
section. 

For this section, we set the following notation: 


k a number field, with algebraic closure k, 
V/k  anon-singular projective variety defined over k, 
Div(V) the group of divisors on V, 


hp the (absolute logarithmic) height function hp : P"(k) > R on 
projective space as defined in [AEC, VIII §5]. 


A morphism ¢ : V — W between non-singular varieties induces a 
homomorphism of their divisor groups ¢* : Div(W) — Div(V) in the fol- 
lowing way. Let  € Div(W) be an irreducible divisor and fix a function up 
which vanishes to order 1 along [’. Equivalently, up is a generator for the 
maximal ideal in the discrete valuation ring Oy.r. Then 


T= S° orda(uro@)A, 


AEDiv(V) 


where the sum is over all irreducible divisors A € Div(V) and we are writing 
orda : k(V)* — Z for the normalized valuation on the local ring Oya. Of 
course, we have cheated a little bit. The divisor ¢*I will only be defined if 
the image $(V) is not contained in I, since otherwise ur o ¢ is identically 
zero. However, ¢* sends principal divisors to principal divisors, so it induces 
a map ¢* : Pic(W) — Pic(V) which is well-defined on all of Pic(W), since 
we can always move I by a linear equivalence so that it intersects ¢(V) 
properly. 
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The following theorem is often called the “Height Machine.” It is the 
main result of this section. The Height Machine associates a height function 
to each divisor on V, or, more precisely, it associates an equivalence class 
of height functions to each divisor class on V. The power of the height 
machine is that it takes geometric relations involving divisor classes on V 
and translates them into height relations between points on V. It is thus 
a tool for transforming geometric information into arithmetic information. 
We have already seen this machine in action in [AEC, VIII §6], where 
the geometric group law on an elliptic curve was transformed into the 
arithmetic statement h(P + Q) + h(P — Q) = 2h(P) + 2h(Q) + O(1). 
The general formulation of the Height Machine is due to André Weil. For 
further details and additional properties of heights, see Lang [4], Hindry- 
Silverman [1], and exercises 3.31 and 3.32. 


Theorem 10.1.  (Weil’s Height Machine, Weil [2]) Let V be a non- 
singular projective variety defined over a number field k. There is a map 


h: Div(V) — {functions V(k) — R}, 


uniquely determined up to bounded functions on V(k), with the following 
two properties: 

(a) (Normalization) Let ¢: V — P” be a morphism, and let H € Div(P”) 
be a hyperplane with the property that ¢(V) ¢ H. Then 


hg-u(P) = hp(o(P)) #O(1) for all PE V(). 


(b) (Additivity) Let D, D’ € Div(V). Then 
hpip(P) =hp(P)+hp(P)+ O(1) for all PE V(k). 


The height mapping has the following additional properties: 
(c) (Equivalence) Let D, D’ € Div(V) be linearly equivalent divisors. Then 


hp(P) =hp(P)+O(1) for all PE V(k). 
(d) (Functoriality) Let »:V — W be a morphism of non-singular projec- 
tive varieties over k, and let D € Div(W). Then 


hy p(P) = hw.o(v(P)) +O) for all P € V(k). 


Remark 10.1.1. Another way to formulate Theorem 10.1 is as follows. 
For every V/Q there is a unique homomorphism 

{functions V(Q) — R} 
{bounded functions V(Q) — R} 


such that hpr is the usual height on projective space [AEC, VIII §5] and 
such that hy.y-p = hw,p ow for every morphism ~:V — W. 


hy : Pic(V) — 
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Example 10.1.2. Let E’/k be an elliptic curve. Consider the divisor re- 
lation [2]*(O) ~ 4(O), which follows from [AEC, III.3.5] and the fact that 
the four points in E[2] sum to zero. Then (10.1) gives the height relation 


Avo) ([2]P) = hpg+(0)(P) + O(1) = havo) + O(1) = 4hioy (P) + O(1). 


This is one of the properties of the height that was used in the proof of the 
Mordell-Weil theorem [AEC, VIII.6.7]. 


Example 10.1.3. Two divisors D, D’ € Div(P") on the projective line are 
linearly equivalent if and only if they have the same degree. It follows from 
the additivity and equivalence properties (10.1b,c) that 


deg(D')hp(P) = deg(D)hp (P)+O(1) for all P € P1(k). 
In particular, (10.1a) implies that hp(P) = deg(D)hp(P) + O(1). 


For curves of higher genus, the identity (10.1.3) will not be true, since 
divisors of the same degree need not be linearly equivalent. However, a 
slightly weaker result is valid, as described in the following result. For a 
generalization to varieties of arbitrary dimension, see exercise 3.32. 


Theorem 10.2. Let C be a curve, let D,D’ € Div(C) be divisors 
with deg(D) # 0, and let hp, hp’ be associated height functions. Then 


hp(P) _ deg(D') 


if a ey 


PEC(k),hp(P)00 hp(P) — deg(D) " 


The last theorem that we will be proving in this section is a finiteness 
result. Recall [AEC, VIJJI.5.11] that in projective space P” (k), there are only 
finitely many points of bounded height. Of course, here height means hp, 
the standard height on projective space. It is clear that this result cannot 
be true for every height hp on every variety. For example, if V(x) is infinite, 
then it cannot be true for both hp and h_p, since h_p = —hp + O(1). 
In order to give the correct statement, we need one definition. 


Definition. A divisor D € Div(V) is called very ample if there is an 
embedding ¢: V — P” and a hyperplane H € Div(P”) not containing ¢(V) 
so that D = ¢*H. (To say that 6: V > P” is an embedding means that ¢ 
maps V isomorphically onto its image.) The divisor D is called ample if 
there is an integer n > 0 so that nD is very ample. 


Theorem 10.3. Let D € Div(V) be an ample divisor on V, and let hp : 
V(k) — R be an associated height function. Then for all a,b > 0, the set 
{PeEV(k):hp(P)<a and [k(P):k] <b} 
is finite. In particular, the set {P € V(k’) : A(P) < a} is finite for any 

finite extension k'/k. 
Example 10.3.1. Let C be a non-singular curve of genus g, and let D € 
Div(C) be a divisor. Then D is ample if deg(D) > 0, and D is very 


ample if deg(D) > 2g +1. See Hartshorne [1, IV.3.2, IV.3.3] or [AEC, 
exercise ITT.3.6]. 
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Example 10.3.2. Let S be a non-singular surface. The criterion of Nakai- 
Moishezon (Hartshorne {1, V.1.10]) says that a divisor D € Div(S) is ample 
if and only if D? > 0 and D-T > 0 for all irreducible curves I c S. 


As mentioned above, the reader may at this point wish to proceed 
directly to the next section, where we will apply the Height Machine to 
study the specialization map on elliptic surfaces. The remainder of this 
section is devoted to proving Theorems 10.1—10.3. 


Definition. Let ¢: V — P” be a morphism of V into projective space. A 
divisor D € Div(V) is said to be associated to ¢ if there is a hyperplane H € 
Div(P"), not containing ¢(V), such that D = ¢*H. Note that the divisor 
class of D is uniquely determined by ¢, since any two hyperplanes in P* 
are linearly equivalent. 

The height on V associated to ¢ is the height function 


hg : V(k) —_ R, hg(P) = hp(¢(P)). 


Our ultimate goal is to associate to every divisor D on V a height 
function hp with the properties described in (10.1). In particular, we will 
want the heights attached to linearly equivalent divisors to be essentially 
the same. The following important proposition will be crucial for this 
construction. 


Lemma 10.4. Let 6: V ~F andy: V — FP* be morphisms which are 
associated to the same divisor class. Then 


hg(P) = hy(P) + O(1) for all P € V(k). 
Here the O(1) depends on ¢ and yw but is independent of P. 


Proor. Let D be any positive divisor in the divisor class associated to ¢ 
and w. This means that on the complement of D we can write ¢ and y in the 
form ¢ = [fo,..-,f,] and # = [go,..., 9s] with rational functions fi,g; € 
k(V) satisfying 


div(f;) = D;-—D and div(g;) = D,— D_ for divisors D;, Di, > 0. 


Further, the fact that ¢ is a morphism means that the D,’s have no points in 
common, and similarly for the D; ’s. (For general facts about the relation- 
ship between morphisms ¢ : V — P" and divisors, see Hartshorne [1, IT §7], 
especially the section on linear systems.) 

Now fix some j, let Vj = V \ Dj be the complement of Dj, and 
let F; = fi/g; forO <i<r. Notice that 


div(F,) = div(fi/a) = (Di — D) — (D5 — D) = Dy - DI, 
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so F; is a regular function on V;. Taken together, the F;,’s define a morphism 
F =(Fo,...,F,):Vj — At. 


We also observe that the F;’s have no common zeros on Vj, since any 
common zero would lie on all of the D,’s. 

We need to recall how the maximal ideals in the ring ® = k[Fo,..., Fy] 
correspond to the points of V;. If 2 C K is a maximal ideal, then R/IM 
is a finitely generated k-algebra which is also a field. It follows from 
the weak Nullstellensatz (Atiyah-MacDonald [1, 5.24, 7.10], Lang [7, X §2 
Cor. 2.2]) that 9/9 is isomorphic to k. More precisely, the natural in- 
clusion k — /9N is an isomorphism. This means that there are unique 
elements ag,...,a, € k so that F; = a; (mod 9), and then there is a 
unique point Po € V;(k) with F(Pan) = (a0,.--,a,r). Equivalently, the 
point Psy is determined by the congruences 


F, = F;(Pm) (mod 9) forO <i<r. 


Now consider the ideal J = (Fo,...,F,) C R generated by the F;’s. 
We claim that J must be the unit ideal. To prove this, we assume that 3 
is not the unit ideal and derive a contradiction. Every non-unit ideal is 
contained in at least one maximal ideal, so we take a maximal ideal IM 
with 3 C M. Then M corresponds to a point Psy € Vj as described above. 
On the other hand, we have F; € 3 C MM from the definition of 3, so 


F,(Pm) = Fi =0 (mod 9M). 


Hence F;(Pyx) € KAM, so F;(Pon) = 0. In other words, Psy is a common 
zero of Fo,..., F;, which is a contradiction. This completes the proof that J 
is the unit ideal. 

We can rephrase this last argument in slightly fancier language. The 
scheme Spec() is isomorphic to V;, and by the weak Nullstellentsatz, 
maximal ideals in Spec(®) correspond to k-valued points in V;. But then 
any maximal ideal SN containing 3 would correspond to a point P in the 
zero set of 3, contradicting the fact that the F;’s have no common zero. 

The fact that J = (fo,...,F,) is the unit ideal in R means that we 
can find a polynomial A;(Tp,...,T;) € k[To,.--,T;] with no constant term 
such that 

1=A;(Fo,..-.,F,). 


For any finite extension k’/k, any point P € V;(k’), and any absolute 
value v on k’, we evaluate this identity at P, take the v-adic absolute 
value, and use the triangle inequality to get an estimate of the form 


1 <c; max{|Fo(P)|v,.--,|Fr(P)|o}- 
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Here c; = c1(v,¢,~, D,A;) > 0 is a constant that does not depend on P. 
Further, for all but finitely many absolute values on k’, we can take c; = 1. 
(See the proof of [AEC, VIII.5.6] for a similar calculation.) 

Recall that F; = f;/g;, so if we multiply both sides by |g;(P)|, we 
obtain the estimate 


lgj(P)lv < er max{|fo(P)lv,---s|fr-(P)lo}- 


Notice that this bound is still valid if g;(P) = 0, so it holds for all points at 
which the f;’s and g;’s are defined, that is, at all points on the complement 
of D. Taking the maximum for 0 < 7 < s gives 


max{|go(P)|v,---,lgs(P)lv } < co max{|fo(P)lv,---|fr(P) lv}, 


where co = co(v,¢,w,D) is again positive and is equal to 1 for all but 
finitely many v. Now we take the logarithm of both sides, multiply by the 
local degrees [k/, : Q,]/[k’ : Q], and sum over all absolute values on k’ to 
obtain 


h([go(P),---,9s(P)]) < a [fe P pe PE] + €3, 


where c3 = c3(¢, W, D) is independent of P. In other words, we have shown 
that 


h(u(P)) < A((G(P)) +3 for all P € (V\ D)(k). 


The divisor D was chosen to be any positive divisor in the divisor 
class associated to @ and yw. In other words, we can take D to be ¢*H 
for any hyperplane H Cc P" not containing ¢(V). Let My,...,Hm CP” be 
hyperplanes not containing ¢(V) with the property that H)N---N Hm = 
. Then the corresponding divisors $*H,,...,¢*H have no points in 
common, so their complements cover V. Applying the above estimate to 
each of these D’s and letting c4 be the maximum of the c3’s gives 


A(d(P)) <A((o(P)) +e, for all PE V(R). 


This is one of the inequalities we are trying to prove, and the opposite 
inequality follows if we interchange the role of ¢ and w. O 


If D € Div(V) is a very ample divisor, then we can choose an em- 
bedding @ : V — FP” associated to D and attach to D the height func- 
tion hp = hg. For arbitrary divisors D, we will write D = D, — Dz asa 
difference of very ample divisors and define hp linearly by hp = hp, —hp,. 
The following lemma shows that every divisor can be decomposed in this 
way. 
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Lemma 10.5. Every divisor on V can be written as a difference of two 
very ample divisors. 


PROOF. This is a basic result from algebraic geometry. Before giving the 
general proof, we consider a special case. Suppose V is a curve of genus g. 
Then a divisor on V is very ample if it has degree at least 2g + 1 (10.3.1). 
So for an arbitrary divisor D € Div(V), say of degree d, we can write D as 
a difference of very ample divisors 


D =(D+n(P)) —n(P) 


by choosing n = 2g + 1+ |d|. Similarly, let V be a surface, D € Div(V) 
an arbitrary divisor, and H € Div(V) an ample divisor. Then one can use 
the Nakai-Moishezon criterion (10.2.2) to show that nH + D is ample for 
all sufficiently large n, after which it is easy to write D as a difference of 
very ample divisors. We will leave the details to the reader (exercise 3.30) 
and go on to the general case. 

Let D € Div(V) be an arbitrary divisor, and fix a very ample divi- 
sor H € Div(V). Serre’s theorem (Hartshorne [1, II.5.17, IL.7.4.3]) says 
that there is an integer n > 1 so that D+nH is ample. (Note we have 
translated from the language of invertible sheaves into the language of divi- 
sors, as explained in the last part of Hartshorne [1, II §6].) It follows from 
Hartshorne [1, II.7.6] that m(D+n#) is very ample for all sufficiently large 
integers m. Further, nH is very ample, so nH +m(D+nH) is very ample, 
since it is the sum of two very amples. Hence 


D=(m+1)(D+nH) — (nH +m(D +n#H)) 
is a difference of very ample divisors. oO 


Lemma 10.5 lets us decompose a divisor D into a difference D, — D2 of 
very ample divisors. In particular, D; and D2 are associated to morphisms 
from V into projective space. The next result gives some basic properties 
of height functions associated to such morphisms. 


Lemma 10.6. Let ¢, : V — P” and ¢2 : V — P® be morphisms, and 
let D, and Dz be divisors associated to ¢, and ¢2 respectively. 

(a) There exists a morphism $3 : V — P™*+"*® associated to D, + Do. 
(b) If@:V — P” is any morphism associated to D, + D2, then 


he(P) =he,(P)+he(P)+O(1) for all PE V(k). 
Proor. (a) The Segre embedding (Hartshorne [1, exercise 1.2.14] or Har- 
ris [1, 2.11-2.29]) is the map 


P" x Ps os prstrts 
(oierytrly Wonostel) Ks leone se Sar ee teas: 
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It is clear from this definition that if we pull a hyperplane back by the Segre 
embedding, we will get H x P* + P” x H, where the H’s are hyperplanes 
in the appropriate projective spaces. Hence D,; + Dg is associated to the 
morphism 


V diagonal Vey oD, XPD2 P" x Ps Segre prstrts. 


(b) Write 
$1 = [fo,---, fr): V —~P” and $2 = [go,---,9s]: V > P* 


with rational functions fo,...,gs € k(V). Lemma 10.4 says that it suffices 
to prove (b) for any one morphism ¢ associated to D, + D2, so we will 
take ¢ to be the map using the Segre embedding described in (a). In other 
words, 


= [fogo, SS figj: eee fr9s| /V 3 prstrts. 


Let P € V(k) be any point. Replacing k be a finite extension, we may 
assume that P € V(k). Then directly from the definition of the height on 
projective space we have 


he(P) = h(¢(P)) 
Rogol Piss eefiGj (Pas trol P))} 


= eo. 


veEeM, 


Q 
k:Q 

ae ia a area log ails max |gy(P “ 
Q 
Q 


Sty MSS 


O<j<s 
vEM, 


Sees 


Soa (ios (gaax \n(P)|) ibe (uw max [95 ( Pyl)) 


= h([fo(P),---,fr(P)]) + h([go(P),---.9s(P)])- 


After these lengthy preliminaries, we are finally ready to tackle the 
proof of the Height Machine. 


PRooF (of the Height Machine (10.1)). Take each divisor D € Div(V) 
and write it as a difference D = D, — D2 of divisors with the property 
that there are morphisms ¢; : V — P” and ¢2: V — P® associated to D; 
and Dy respectively. Note that (10.5) assures us that this is possible; in 
fact, (10.5) says that we can even choose D; and D2 so that ¢; and ¢2 are 
embeddings. In any case, having fixed D,, Do, 61, ¢2, we define hp to be 


hp(P) =hg,(P) —hg,(P) for all P € V(R). 
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Our first observation is that if the Height Machine exists, then up to a 
bounded function, this is the only choice for hp. This follows from (10.1a) 
and (10.1b), which let us compute 


hp= hD,-D2 a hp, = AD» + O(1) a he,(P) = hé.(P) + O(1). 


This gives the uniqueness assertion in Theorem 10.1. 

Next we show that up to bounded functions, hp is independent of the 
choice of D,, Dz, ¢1,¢2. Once we have proven this independence, the rest 
of (10.1) will follow very easily. So suppose that D = Dj — Dj is another 
decomposition, and let ¢ :V — P” and ¢4:V —> P*’ be morphisms asso- 
ciated to Di, and Dé, respectively. Then Dj + Dz = D; + Dj. Lemma 10.6 
says that there exists a morphism ¢ : V — P” associated to this divisor, 
and then two applications of (10.6b) yields 


hg + hg, _ hg + O(1) = hg, + hey + O(1). 


Therefore 
hg, (P) = hg, (P) = hes (Py hg: (P) + O(1), 


which proves that up to bounded functions, the definition of hp is inde- 
pendent of the choice of D,, Do, d1, de. 

(a) The divisor D = $*H is already associated to a morphism, so we can 
write D as D = D —0. Note that the divisor 0 is associated to the trivial 
morphism yw : V > P° which maps V to a point. Then 


hp = hg — hy + O(1) = hg + O(1), 


since hy(P) = hp(w(P)) is a constant. 

(b) We decompose each of the given divisors into a difference of divisors 
that are associated to morphisms, say D = D, — Dz and D’ = Di — D5. 
Then D, + Dj and D, + Dj are also associated to morphisms (10.6a), and 
their difference is D + D’, so we can use (a) and (10.6b) to compute 


Ap+p' = hp,+p; — hp, +p, + O(1) = hp, + hp, — hp, — hp, + O(1) 
=hp+hp +O(1). 
(c) Write D — D’ as a difference of divisors associated to morphisms, 
say D— D! = D, — Do, with D, associated to ¢, and Dg associated to ¢2. 


Note that D, and D2 are linearly equivalent by assumption, so (10.4) tells 
us that hg, = hg, + O(1). Now using (b) we obtain the desired result, 


hp—-hp =hp_p: + O(1) = hg, = hd» + O(1) = O(1). 


264 III. Elliptic Surfaces 


(d) By the linearity proven in (b), it suffices to prove (d) for a divisor D 
associated to a morphism ¢@: W — P”. Then the divisor ~* D is associated 
to the morphism gow: V — W — P’. Using (a) twice, we find 


hyd = hv. gow + O(1) = hw.e owt O(1) = hw.p owt O(1). 


Note that the middle equality is trivial, since hgoy = hpodow by definition. 
O 


PROOF (of Theorem 10.2). Let d = deg(D) and d’ = deg(D’). Replacing D 
by —D if necessary, we may assume that d > 1. For any integer n we 
consider the divisor 


A, = (29 +1)D+n(d'D — dD’). 


Notice that deg(H,,) = d(2g + 1), so (10.3.1) says that H,, is a very ample 
divisor on C’. In particular, there is an embedding ¢, : C — P” associated 
to H,,, so (10.1a) gives 


hu, (P) = he(dn(P)) +O(1) for all P € C(R). 


Now using the definition of H,, the linearity property of height func- 
tions (10.1b), and the fact that the height on projective space is non- 
negative [AEC, VIII.5.4b], we obtain the estimate 


(2g + l)hp(P) + nd'hp(P) -_ ndhp(P) = hu, (P) + O(1) 2 —Chn.- 


Note that the constant c, depends on n, but it is independent of P. As- 
suming hp(P) > 0, a little algebra then gives the inequality 


d’ hp: (P) 2g+ 1 Cn 
“(G betPY) =~ G  dhp(P) 


Taking the liminf as hp(P) — oo, we obtain 


d' ol?) dgd 
lim inf = ; 
PEC(R). hp (P)0 ( ad hplP) yf > d 


This is true for every value of n (positive and negative), which gives the 
desired result, 


: (4 bot?) 
_ lim — 
PEC(k),hp(P)>00 \d — hp(P) = 


ProoF (of Theorem 10.3). Replacing D by nD and using the fact (10.1b) 
that hnp = nhp + O(1), we may assume that D is very ample. Let ¢: 
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V — P” be an embedding and H € Div(P”) a hyperplane with ¢*H = D. 
Taking a finite extension of k if necessary, we may assume that ¢ is defined 
over k. Then (10.1a) implies that there is a constant c so that 


hp(P) =hg-n(P) > he(o(P))—¢ for all Pe V(k). 
It follows that ¢ maps the set 
{PeV(k):hp(P) <a and [k(P):k] <b} 
injectively into the set 
{QeP'(k) : he(Q)<a+c and [k(Q):k] <b}. 


This last set is finite from [AEC, VIII.5.11], which proves the first part 
of (10.3). The second part follows by setting 6 = 1. Oo 
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In this section we will prove a theorem of Tate which describes how the 
canonical height h(o p(t)) varies as one moves along a section of an elliptic 
surface. As a corollary we obtain a theorem of Silverman, strengthening 
earlier results of Néron, Dem’janenko, and Manin, which says that the 
specialization homomorphism E(k) — €;(k) is injective for all but finitely 
many t € C(k). 

Let 7: € — C bea minimal elliptic surface with corresponding elliptic 
curve F'/K, and let P € E(K). To ease notation, we will write 


P, = op(t) 


for the image of a point t € C by the section op : C — E associated to P. 


Theorem 11.1. (Tate [4]) Assume that the elliptic surface E — C is 
defined over a number field k. For each t € C(k) such that the fiber €, is 
non-singular, let 

(-, +): Ex(k) x Ex(k) —>R 


be the canonical height pairing on the elliptic curve €; [AEC, VIII §9]. 

Fix two points P,Q € E(K)o, let [P,Q] € Div(C) be the divisor 
described in (9.5), and let hip.gj : C(k) — R be an associated height 
function on C' (10.1). Then 


(Pr, Qt) = hyp.qy(t) +O(1) for allt ¢ C(k) such that €, is non-singular. 
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Note that the O(1) bound depends on P and Q but is independent of t. 


Remark 11.1.1. Putting P = Q in (11.1) gives he,(P,) = Ayp,p|(t)+O(1), 
where . - 
he, : Ex(k) ~R 


t 


is the canonical height on € [AEC, III §9]. In other words, for any point P € 
E(K)o, the map 
C(k) — R, tr— he,(P), 


is a height function on C(k) corresponding to the divisor [P,P]. Silver- 
man [6] shows that it is possible to choose the height hyp,p) in such a way 
that the difference he,(P;) — hip,pj(t) varies quite regularly as a function 
of t. For example, consider the elliptic surface and section 


E:yY=2°+#(1-#*)z, Pate wy 


Then there is a power series f(z) € R[z] with f(0) = 0 so that for all 
sufficiently large integers t € Z, 


peti shlt) + i lo82 ay (=) 


For details, see Silverman [5,6]. 


Before beginning the proof of Tate’s theorem (11.1), we need to de- 
scribe how to use a height function on the surface € to compute canonical 
heights on the individual fibers. For any integer n and any non-singular 
fiber €;, we will write 

[n]e: Ep EY 


for the multiplication-by-n map on €;. These maps clearly fit together to 
give a rational map on the surface €, 


hE +8, (nl(ay.t) = (Inje(x,y), 2). 


(N.B. Even if the surface € is minimal, the rational map [n] : € — € will 
generally not extend to a morphism.) With these preliminaries completed, 
we are ready for the following lemma. 


Lemma 11.2. Leta: € —C bean elliptic surface defined over a number 
field k, and let hg o) : E(k) + R bea height function on E associated to the 
divisor (O) € Div(€). Then for all t € C(k) such that €; is non-singular, 
and all points (x,y) € €1(k), 


‘ oy 
he (ay) = lim She o)({n](e,9, €))- 
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PROOF. Fix a Weierstrass equation for €, 
E:y=a2°4 Act B, A,B €Ek(C). 
For the moment we will restrict attention to points t € C(k) such that A 
and B are defined at t and A(t) 4 0. Then €; is obtained by evaluating A 
and B at t. 
For each such t we let p; be the map 
pe: Et — FP’, p(x, y) = 2. 


This x-coordinate function on €; has a double pole at O,, which means 
that 
Pp; (Co) = 2(O;). 
We also write ¢; : €, — € for the inclusion of the fiber €; into the surface €. 
Now let (a, y) € €:(k) be any point on a non-singular fiber. We use 
standard properties of the Height Machine (10.1) to compute 


he yoy(2, yy, t) 


= he oy (or(a,y)) definition of ¢4 
= he,.4s(0)(@, y) + O11) functoriality of height (10.1d) 
=he,o,(t,y) + O11) since $7(O) = (Oz) 
1 
= gher2oy(@, y) +O(1) additivity of height (10.1b) 
1 . * 
a 9 Mee we (co)(#1¥) +O(1) since pi (oo) = 2(O;) 
1 
= 5 Pt (oc) (Pe(@,9)) +O(1)  functoriality of height (10.1d) 
1 
= ghz) + O(1) definition of p;. 


It is important to note that in this computation the O(1) bounds will 
depend on t. This dependence arises because we have used the mor- 
phism ¢, : €, — €, and the O(1) in the functoriality property (10.1d) 
depends on the morphism. However, for a given t, the O(1)’s are indepen- 
dent of the point (a,y) € €,(k). To make the dependence visible, we will 
write 


1 : 
heyoy(z, y, t) = gh) + O,(1) for all (x,y) € Ex(k). 
For any point (x,y) € €:(k) and integer n, write [n];(x, y) = (tn, Yn)- 


Then standard properties of the canonical height [AEC, VIJI.9.3b,e] allow 
us to compute 


5 lim, h(a») = tim + he, (Inle(2,y)) + Ox(1)} 


Win jl > 
tim, ne {n*he,(z, y) + O.(1)} 


= he,(a, y)- 
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Now combining these two formulas gives the desired result, 


ns. cl a, 
dim, She coy ([n](z,y, t)) = lim She (o)(tns Ynst) 


. 1/1 1 1 » 
= Jim 72 (Srl) + ox(1)) = 5 im, 7atn) =he,(z,y). 


It is instructive to note that the O,(1) disappears in the limit because the 
“t-coordinate” of [n](x,y,t) is independent of n. 

This completes the proof of (11.2) for all points t € C(k) such that A 
and B are defined at t and A(t) # 0. But by choosing different Weierstrass 
equations for €, we can cover {t € C(k) : €; is non-singular} by finitely 
many such sets. QO 


Proor (of Theorem 11.1). The divisor [P,Q] is determined up to linear 
equivalence by the relation (9.5a), 


(P+ Q) — (P) —(Q) + (0) ~ x*([P,Q)). 


For any point z € €(k) lying on a non-singular fiber €;, we use standard 
properties of the Height Machine (10.1) to compute 


Alp,Q] (t) = Alp.Q| (x(z)) since z € &% 
= hy-(p.g(z) + O(1) functoriality of height (10.1d) 
= h(p+q)-(P)-(@)+(0) (2) + OV) equivalence of heights (10.1c) 
= h(p+ay(2) — hry (2) — hin (2) + hyo) (2) + OC) 
additivity of height (10.1b) 
= hye, .(0)(2) — hee ,(0)(2) — her (oy (2) + oy (2) + O11) 
where Tr: € > E is translation-by-R 
= hwo) (t™~P-@(2)) — ho (7 (2)) — bv (7-@(2)) 
+hyo)(z) + O(1) — functoriality of height (10.1d) 
= ho)(—P: — Q¢ + z) — hioy(-—P: + 2) — hoy (—-Q + 2) 
+hioy)(z)+O(1) — since z € E.(k), so TR(z) = Ry t+ z. 


Note that the O(1) constants appearing in this calculation depend on P 
and Q, but they are independent of z and t = z(z). To indicate this 
dependence, we will write Op.g(1). For each pair of integers 1 <i,j7 <n, 
we evaluate at the point z = iP, + jQ; € E(k) to obtain 


Apa (t) = hyo) ((i — IP + (7 — 1)Q2) — hoy (GE - Pe + 5Q1) 
hoy (tPi + (9G — Qt) + Aco) (@Pi + Qt) + Op.Q(1). 
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Summing this identity over 1 < i,j <n, we find that most of the terms 
telescope, leaving 


n*hyp,q(t) = ho) (NP; + nQt) — hyo) (Pr) — hcoy (nQr) 
+hioy (Or) + Op.g(n’). 


Now dividing by n?, letting n — 00, and using (11.2) yields 


: 1 . 1 
hipai(t) = lim sho (nP, + nQ:) — lim —shio)(nFi) 


: 1 
= jim. 720) (n@e) + Opg(1) 


= he,(P: + Qt) — he,(P:) — he, (Qt) + Op.a(1) 
= (Pi, Qt)t + Op.g(1). 


This completes the proof of Theorem 11.1. Oo 


Taking the limit of (11.1) as the height of t goes to infinity, we can re- 
cover the following result of Silverman which will be used below to prove the 
injectivity of the specialization map. In the special case that the elliptic sur- 
face € > C is split, this result had earlier been proven by Dem’janenko [1] 
and Manin [2]. 


Corollary 11.3.1. (Silverman [1], [7]) Let € — C be an elliptic surface 
defined over a number field k, fix two points P,Q € E(K), and let (P,Q) 
denote the canonical height pairing (4.3, 9.3) of P and Q on E(K). Further, 
let hs : C(k) > R be a height function on C corresponding to a divisor 6 € 
Div(C) of degree 1, and for each t € C(k) such that €; is non-singular 
let (-, +); be the canonical height pairing on €;(k). Then 


(Pe, Qt)t = (P, Q). 


lim 
tEC(k),hs(t)oo hs (t) 


Notice that (11.3.1) applies to all points in E( 4’), not just those in the 
subgroup E (4. )o. It is possible to improve (11.3.1) as described in our next 
result, but we will only give the proof in the case that the base curve C’ 
is Pt. 

Corollary 11.3.2. (Tate [4]) Let € + C, P,Q € E(K), hs, and (-,-); 
be as in (11.3.1). 
(a) Suppose that the base curve C is isomorphic to P!. Then 


(Pi, Qt)t = (P,Q)hs(t) + O(1) fort € P(k) with €, non-singular. 


Notice that in this result we can take hs to be the usual height function 
on P}, 
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(b) For an arbitrary base curve C, we have 
(Pi, Qt) = (P, Q)hs(t) + O(./hs(t)) for t € C(k) with €; non-singular. 


(See exercise 3.34 for the case that C’ has genus 1.) 


We will prove (11.3.1) and (11.3.2) simultaneously. 
Proor (of Corollaries 11.3.1 and 11.3.2). The subgroup E(k) has finite 


index in E(k) from (9.4), so given any two points P,Q € E(K), we can 
find an integer N such that NP,NQ € E(K)o. Note that N depends only 


on P and Q. Further, the canonical height pairings on E() and €,(k) are 
bilinear, so 


(NP, NQt)t = N?7(Pi, Qt)e and (NP, NQ) = N?(P,Q). 


Replacing P,Q by NP,NQ and dividing each of the formulas in (11.3.1) 
and (11.3.2) by N?, we see that. it suffices to prove the two corollaries for 
points P,Q € E(K)o. 

Assuming now that P and Q are in E(K)o, (9.5c) tells us that 


(P,Q) = deg[P, Q]. 
Hence the divisor 
6B = [P,Q] — (P,Q)é € Div(C) 


is a divisor of degree 0. Using (11.1) and the additivity of heights (10.1b) 
yields the estimate 


(Pi, Qt)e = hyp.ay(t) + O(1) = (P, Q)hs(t) + halt) + O11). 


We consider three cases. 
First, to prove (11.3.1), we divide by hs(t) and take the limit 


ha(t) 
he(t) oo hg(t) 


(Ps Qee _ (P,Q) + 


im 
hs(t)hooo hg (t) 


Now (10.2) and the fact that deg(@) = 0 imply that 


ha(t) _ deg 
= = 0) 
ee oe hs(t)  degé i 


which completes the proof of (11.3.1). 
Next, suppose that C = P!. Then @ ~ 0, since two divisors in Div(P!) 
are linearly equivalent if and only if they have the same degree [AEC, II.3.2]. 
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The equivalence property of heights (10.1c) implies that hg(t) is bounded, 
so the above estimate becomes 


(Pi, Qe)t = (P, Q)ha(t) + O(1). 


This finishes the proof of (11.3.2a). 
We will not give the proof of (11.3.2b), other than to say that one uses 
an estimate 


hg(t) = O(Vhs(t)) + O(1), 


valid for divisors G of degree 0. This in turn follows from properties of the 
canonical height on the Jacobian variety of the curve C. For the complete 
proof, see Lang [4, 12 corollary 5.4] or Tate [4]. The special case that C 
has genus 1 is discussed in exercises 3.33 and 3.34. oO 


Let € — C be an elliptic surface. Each point P € E(k) defines a 
morphism op : C' — € which we have been denoting by t > P;. Turning 


this around, we can also say that each point t € C(k) determines a map 
o,: E(K) — €,(k), Pr FP, 


called the specialization map of € at t. If the fiber €; is non-singular, then 
it is clear that the specialization map is a homomorphism, 


o(P+Q) = (P+Q)t = P; +Q: = 01(P) + 04(Q). 


This follows from the fact that on a non-singular fiber €;, the section opig 
is defined by the relation op4Q(t) = op(t) + oQ(t). We now show that for 
“most” values of t the specialization homomorphism is injective. 


Theorem 11.4. (Silverman [1], (7]) Let € — C be a non-split elliptic 
surface defined over a number field k, and let 6 € Div(C) be a divisor of 
positive degree. Then there is a constant c > 0 so that 


o,: E(K) — &4(k) is injective for all t € C(k) satisfying hs(t) > ce. 


(One says that the set of points where o;, fails to be injective is a set of 
bounded height.) In particular, the specialization map o, : E(K) — €,(k) 
is injective for all but finitely many points t € C(k). 


Remark 11.4.1. In the case that the elliptic surface is split, there is a 
version of (11.4) due to Dem’janenko [1] and Manin [2]. Both Dem’janenko 
and Manin used their results to prove the Mordell conjecture (now Faltings’ 
theorem) for certain curves. See exercise 3.16. 
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Remark 11.4.2. There is an earlier result, due to Néron [3] using a Hilbert 
irreducibility argument, which says that if Ck) is infinite, then there are in- 
finitely many ¢ € C(k) for which the specialization map 0; : E(K) — €4(k) 
is injective. This is sufficient for one of the main applications of (11.4), 
namely the construction of elliptic curves of elevated rank over Q or over 
number fields k. The idea is to find an elliptic surface € — C over k for 
which C'(k) is infinite and such that E(/) has high rank. Then specializ- 
ing t € C(k) gives elliptic curves over k of high rank. Néron [3] used this 
procedure to construct infinitely many elliptic curves over Q with rank at 
least 10. More recently, Mestre [2] constructed an elliptic surface € > P! 
defined over Q so that E(Q(t)) has rank at least 12, and Nagao [2] extended 
this result to get rank at least 13. By taking particular values for t € Q, 
it is possible to find specific elliptic curves over Q with even higher ranks. 
See Fermigier [1], Nagao [1], and Nagao-Kouya [1] for examples with ranks 
at least 19, 20, and 21. And the quest continues! 


Proor (of Theorem 11.4). Our assumption that € — C is non-split means 
that the Mordell-Weil theorem (6.1) is valid, so the group E(K) is finitely 
generated. In particular, the torsion subgroup E'(K)tors is finite. 


Let P € E(K) be any non-zero point. Then there are only finitely 
many t € C(k) for which P; = O;, since the two divisors (P) and (O) 
intersect in only finitely many points. This holds for each of the finitely 
many points in F(K’)tors, So we see that on torsion points, the specialization 
map 


04: E(K)tors > €,(k) 


is injective for all but finitely many t € C(k). (In fact, the specialization 
map is injective on torsion whenever €; is non-singular, since the residue 
field k has characteristic 0. This follows from the identification of the 
kernel of the specialization map with the formal group of the elliptic curve; 
see [AEC, IV.3.2b, VI.2.2].) 

Next let P',...,P” € E(K) be generators for the free part of E(K); 
that is, P!,...,P” give a basis for the free group E(K)/E(K)tors. Then 
the non-degeneracy of the canonical height pairing on E(/‘) described 
in (4.3cd) implies that 


det ((P’, P’)) 


1si,jgr ie 


(This is an elementary property of non-degenerate bilinear forms; see Lem- 
ma 11.5 below.) 


Next we specialize the P*’s to the fiber €;, take the height regulator 
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of the resulting Pj’s, and use (11.3.1) to compute 
det ((P?, P} - pi. pi 
ig (( t De)isises = st ( lim ( - 7 Me 
he (t) 00 he(t) alee. AO) fs ganze 
= det ((P*, P)) 
#0. 


Hence there is a constant c so that 


1S<tjsr 


CCC eae ed ere ee 40 for all t € C(k) with hg(t) >. 


It follows from (11.5) and the non-degeneracy of the height pairing on €;(k) 
[AEC, VIII.9.3 or VIII.9.6] that the points P;,..., Pf are linearly indepen- 
dent provided that hs(t) > ce. 

Adjusting c if necessary to account for the finitely many points in 
the torsion subgroup E(K)tors, we have now proven that for all t € C(k) 
with hs(t) > c, both of the specialization maps 


E(K )tors ——s Esk )es and E(K)/E( )tors + Ex (k)/E+(k) tors 
are injective. Now a simple diagram chase using the commutative diagram 


0 — E(K)trs — E(K) — E(K)/E(K)tos — 0 


! l | 


0 —> Ex(k)tors =F E4(k) — vi E(k) /Ee(k) tors — 0 


shows that E(K) — €:(k) is injective, which completes the proof of the 
first part of (11.4). 

The second part is then an immediate consequence of the first part and 
of (10.3) once we observe (10.3.1) that on a curve, any divisor of positive 
degree is ample. Oo 


It remains to prove the elementary property of non-degenerate bilinear 
forms used in the proof of (11.4). 


Lemma 11.5. Let I be a free abelian group, let (.,-) be a positive 
definite bilinear form on T with values in Q, and let x1,...,z, € TP. Then 


X1,-..,2, are linearly independent => det is) ozs 5e3 x 


PROOF. Suppose first that the determinant is 0. This means that there 
are integers a,,...,a,, not all zero, so that 


S> ai(wi, 23) =0 for alll <j <r. 


i=l 
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Multiplying by a;, summing over j, and using the bilinearity gives 


In other words, y = S> a,x; €T satisfies (y, y) = 0, so the positivity of the 
bilinear form implies that y = 0. Hence the x;’s are linearly dependent. 
Conversely, if the x;’s are linearly dependent, say S> a;x2; = 0, then 
the linearity of the pairing implies that the rows of the matrix (aes, a5)) 
are linearly dependent, so the determinant is zero. oO 


§12. Integral Points on Elliptic Curves over Function Fields 


There is a theory of S-integral points on elliptic curves over function fields 
which is completely analogous to the theory over number fields as described 
in [AEC, Ch. IX]. However, for function fields it is possible to prove much 
stronger results using relatively elementary techniques. In this section 
we will give a short and elegant proof of the analogue of Siegel’s theo- 
rem [AEC, IX.3.2.1] which asserts that an elliptic curve has only finitely 
many S-integral points. We will also state and briefly sketch the proof of 
an effective version of this result. 

The simplest function field analogue of integral points are “polynomial 
points.” Thus let £/k(T) be an elliptic curve over a rational function field, 
say given by a Weierstrass equation 


E:y=a23+A(T)x+B(T) — with A(T), B(T) € kIT]. 
The the set of polynomial points of EF is the set 
{P =(a,y) € E(k(T)) : x,y € kT]. 
For example, if 
E:y=2°-T*z+T? and P=(T,T), 

then (1.1.1) says that P and 2P are polynomial points, but 3P is not. 

One way to characterize the polynomial ring k[T] is to observe that it 
is the subring of k(T) consisting of functions with no (finite) poles. More 


generally, we define the ring of S-integers of an arbitrary function field in 
the following way. 


§12. Integral Points on Elliptic Curves over Function Fields 275 


Definition. Let kK = k(C) be the function field of a curve, and let SCC 
be a non-empty finite set of points of C. The ring of S-integers of K is the 
ring 

Rs ={f €K : ord;(f) > 0 for all t ¢ S}. 
Here ord,(f) is the order of vanishing of f at t; see [AEC, II §1]. 


The following function field analogue of Siegel’s theorem is a special 
case of a result of Lang [8], who proved a general finiteness theorem for 
integral points on curves of arbitrary genus over function fields. 


Theorem 12.1. Let kK = k(C) be the function field of a curve, let S CC 
be a non-empty finite set of points of C, let E/K be an elliptic curve that 
does not split over k, and let F € K(E) be a non-constant function on E. 
Then 

{Pe E(K) : F(P)€Rs} 


is a finite set. 


ProoF. Our first observation is that it suffices to prove (12.1) for the 
special case that F’ is taken to be the x-coordinate on some Weierstrass 
equation for £/K. The reduction from the general case to this special case 
is given for number fields in [AEC, IX.3.2.2], but the proof is the same for 
function fields. So we are reduced to showing that 


{Pe E(K):a(P)€Rs} 


is a finite set. 
If P is a point in this set, then the height of P is a sum of local 
contributions coming from the points in S. More precisely, we have 


RP) Shieh) definition of h(P) 
= pi max{—ord;(2(P)),0} from (4.1) 
teC 
= > max {- ord;(a(P)),0} — since ord:(2(P)) > 0 fort ¢ S$ 
tes 


< #5 - max{—ord(x(P)) }. 


Further, our assumption that FE does not split combined with (5.4) tells 
us that £(/<) has only finitely many points of bounded height. Hence the 
following result (12.2) completes the proof of (12.1). oO 


Theorem 12.2. (Manin [3]) Let K = k(C) be the function field of a 
curve, let t € C be any point of C, and let E/K be an elliptic curve that 
does not split over k, say given by a Weierstrass equation 


Bsa a? fh Ag 
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Then the function ord;(a(P)) is bounded below as P ranges over E(K). 


Before giving the proof of (12.2), we want to observe just how strong 
a statement it is. For example, the number field analogue of (12.2) is 
certainly false. Thus, if we let p € Z be a prime and E/Q be an elliptic 
curve with a rational point 


P € E(Q) satisfying ord,(x(P)) <0, 
then consideration of the formal group of E(Q,) shows immediately that 
ordy(x(p"P)) + —co as n > 00. 


The following short proof of (12.2), which is due to Voloch [1], uses 
the formal group and depends crucially on the fact that the base field k 
has characteristic 0. 


PROooF. (of Theorem 12.2, Voloch [1]) We may replace the constant field k 

by its algebraic closure, since this will only have the effect of making E(K) 

larger. Further, replacing x by u?xz for some u € K*, we may assume that 

the given Weierstrass equation is a minimal equation for the valuation ord,. 
For each integer n > 1, let 


E,(K) = {P € E(K) : ord,(a(P)) < —2n}. 


This is the standard filtration on the formal group of EF, see [AEC, Ch. IV]. 
The crucial facts to note here are that each E,,(K) is a subgroup of E(k), 
and each quotient group E,,(K)/E,41(K) is isomorphic to a subgroup of k. 
To see this, let K; be the completion of the field AK for the valuation ord, 
let R; be the ring of integers of Ky, let MM; be the maximal ideal of R;, and 
let E be the formal group of E/K;. It follows from [AEC, IV.3.1.3] that 
En( Ky) & E(M®), and then [AEC, IV.3.2(a)] tells us that 


En(Ky)/En+i(Ke) & EOMP)/ BME *") = Mp /Met & k. 


(The last isomorphism uses R:/M, & k.) Now the fact that E,(K) = 
E(K) 0 E,,(K;,) immediately implies our two assertions that E,(K) is a 
subgroup of E(k) and E,,(K)/E,41(K) is isomorphic to a subgroup of k. 
In particular, our assumption that & has characteristic zero means that the 
quotient groups E,(K)/En+41(K) have no elements of finite order. 

Suppose now that ord;¢(z(P)) is not bounded below on E(K). Then 
we can choose a sequence of points P;, P2,... with 


ord; (2(Pi)) = —2n; and O<n <ng<ng<-:-. 


We claim that the points P,, P2,... are linearly independent, which will 
contradict the Mordell-Weil theorem (6.1) and thus complete the proof 
of (12.2). 
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Suppose to the contrary that P,, P2,... are linearly dependent. Dis- 
carding the first few P,’s if necessary and relabeling, we may assume that 
there is a relation 


m,P, +m2P2+-+:++m,P, =O with m, £0. 


Using the fact that ny < ng <--- <n, and that the F,(K)’s are nested 
subgroups of F(A’), we see that 


m,P, = —m2P) —--:—m,P, € En, (K), som, P, € En, 41(K). 


But as noted above, the quotient E,,(K)/En,41(4) is isomorphic 
to a subgroup of k, and & is a field of characteristic zero. So the fact 
that m,P, is zero in this quotient implies that P; itself is zero in the 
quotient. In other words, P; € Ey,,41(4), which contradicts the fact that 
ord;(z(P,)) = —2n,. Hence the P;’s are independent. oO 


The proofs of (12.1) and (12.2) are ineffective because they depend on 
the Mordell-Weil theorem (6.1). Notice that (12.2) is analogous to Siegel’s 
theorem [AEC, IX.3.1], although (12.2) is both stronger and considerably 
easier to prove. 

Similarly, one can prove effective bounds for S-integral points in the 
function field case which are analogous to the bounds provided by linear- 
forms-in-logarithms methods for number fields [AEC, 85]. Again the func- 
tion field estimates are stronger and much easier to prove. A number of 
people have given such bounds, including for example Schmidt [1], Ma- 
son [1], and Hindry-Silverman [2]. We will briefly sketch the proof of the 
following version. The argument is the same for elliptic or hyperelliptic 
curves, so we give the more general case. 


Theorem 12.3. Let K = k(C) be the function field of a curve C' of 
genus g, let S C C be a non-empty finite set of points of C, let Rg be the 
ring of S-integers of K, and let f(x) € Rs|x] be a monic polynomial with 
discriminant A satisfying A € R%. Suppose that x,y € Rg satisfy 


y= a): 


Then 
h(y*/A) < 4n(n — 1) max{2g — 2+ #5, 0}. 


(Recall from §4 that the height of an element f € K is defined to be the 
degree of the map f : C — P!. Also note that the set S has to be chosen 
large enough to contain all of the zeros and poles of A.) 


Remark 12.3.1. The bound in (12.3) is stated for y+/A because this 
quantity is invariant under linear change of variables. However, it is easy 
to use (12.3), the relation y? = f(x), and elementary properties of height 
functions to give a bound for h(x) in terms of the coefficients of f. See 
exercise 3.39 for the particular case of an elliptic curve. 
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Proor. (Sketch of Theorem 12.3) The first step is to reduce the problem 
of S-integral points on elliptic curves to the problem of solving the S-unit 
equation 


utve=l1, u,v € RG. 


This reduction procedure is due to Siegel and is described in [AEC, IX.4.3]. 
Next, one proves that if u,v € R% satisfy u+v = 1, then 


h(u) < max{2g — 2+ #5) 0}. 


This bound is the function field analogue of the abc-conjecture of Masser 
and Osterlé. There are several elementary proofs available; see for exam- 
ple Mason [1], Silverman [8], or Vojta [1]. Tracing back through Siegel’s 
argument gives the estimate described in (12.3). We leave the details to 
the reader, or see Hindry-Silverman [2, Prop. 8.2]. Oo 


EXERCISES 


3.1. For any pair (a, 6), let Ea.» be the curve 
Eae iy? tary + by = 2° + ba?. 


Notice that the point (0,0) is on each of these curves. 

(a) Let E/k be an elliptic curve, let P € E(k) be a point, and assume that 
P,2P,3P # 0. Prove that FE has a Weierstrass equation of the form Ea» 
with P corresponding to (0,0). (Hint. Move P to (0,0), rotate so that the 
tangent line at (0,0) is the z-axis, and make a dilation to get az = a3.) 
(b) Prove that 5P = O if and only if a = 6+ 1. Conclude that every 
elliptic curve E/k with a point P of exact order 5 is isomorphic to some 
fiber of the elliptic surface 


E:y? + (t+ lacy t+ ty = 2% 4 ta? 


by an isomorphism taking P to the point (0,0) on that fiber. 


(c) More precisely, if E is defined over k and if P € E(k) is a point 
of exact order 5, prove that there is a unique point to € P’(k) and an 
isomorphism ¢: E — €¢, defined over k such that ¢(P) = (0,0). 

(d) Using a similar construction, find an elliptic surface € — P’ which 
classifies elliptic curves with a given point of order 7. 
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3.2. 


3.3. 


3.4. 


3.5. 


3.6. 


Let C/Q be the (elliptic) curve 
C:s?-s=t-?t, 
let E/Q(C) be the elliptic curve 
2 2 Di a8 2 
E:y' 4+(st+t—s*)ry+s(s—1)(s—t)t'y=2 + s(s—1)(s—t)ta’, 


and let P = (0,0) € E(Q(C))[11] be the point of order 11 as described 
in (1.1.3). 

(a) Let A be an elliptic curve and Q € A a point of order 11. Prove that 
there is a unique point (so, to) € C such that if we substitute (s,t) = (so, to) 
into the equations for E and P, then we obtain an elliptic curve Eo and a 
point Po € Eo of order 11 such that there is an isomorphism ¢: A — Eo 
satisfying ¢(Q) = Po. 

(b) If A is defined over k and Q € E(k), prove that the point (so, to) 
obtained in (a) will lie in C(k). 

This exercise gives a function field analogue of [AEC, exercise 8.1]. Let C/k 
be a curve of genus g with function field K = k(C), and let E/K be an 
elliptic curve which does not split over kK. 

(a) Suppose that E/K has a Weierstrass equation of the form 


E:y? =(a#—1)(a — e2)(a — es) with €1,e2,e3 € K. 


Let S C C be the set of points where any one of €1, €2, e3 has a pole together 
with the points where the product (e1 —e2)(e1 —e3)(e2 —e3) vanishes. Prove 
that 

rank E(k) < 4g + 2#S — 2. 


(b) Suppose that E/K has a Weierstrass equation of the form 
E:y=2°+Ac+B with A,BeK. 


Let S CC be the set of points where A or B has a pole together with the 
points where A = 4A® + 27B? vanishes. Find an explicit bound for the 
rank of E(4) in terms of g and #S. 


Let k/Q be a finitely generated field extension, that is, k = Q(a1,...,ar) 
for some a1,...,a, € C. Let C/k be a curve with function field K = k(C), 
and let E/K be an elliptic curve. Prove that E(K) is a finitely generated 
group. In particular, this is true if k is a number field. (Hint. You may 
find exercise 3.15 below useful for doing this problem.) 


Let A be an abelian variety of dimension one, so in particular A is a non- 
singular projective curve. (See §2 for the definition of abelian variety.) 
Prove that A has genus 1, so A is an elliptic curve. 


Let 6: V — W bea rational map of projective varieties. 

(a) Prove that the image ¢(V) is an algebraic subset of W. 

(b) Suppose that V is non-singular. Prove that the set of points where ¢ 
is not defined has codimension at least two in V. 
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3.9. 


3.10. 


3.11. 
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Recall that a topological space X is irreducible if it cannot be written as a 
union X = X; U X2 of non-empty closed subsets of X. 

(a) Let X be a topological space, and let Z C X be a subset taken with 
the induced topology. If Z is irreducible, prove that the closure of Z in X 
is also irreducible. 

(b) Let ¢: X — Y be a continuous map of topological spaces. If X is 
irreducible, prove that its image @(X) is irreducible. 

(c) Use (a) and (b) to deduce the second part of Proposition 3.5(a). 


Let € — C be an elliptic surface over k. Define a map 

jesC —P*, — je(t) = j(Es). 
More precisely, define je(t) to be the j-invariant of the elliptic curve €: 
provided that the fiber €; is non-singular, and for the moment leave it 
undefined for the remaining points of C. Prove that je is an algebraic 
map, and conclude that it extends to a morphism from C to P?. 
Let € — C be an elliptic surface over k, and let jg : C — P? be the 
morphism defined in exercise 3.8. 
(a) If € > C splits over k, prove that je is a constant map. 
(b) Give an example (with proof) of an elliptic surface € — C that does 
not split over k for which je is a constant map. 
Let € — C be an elliptic surface over k, and let jg : C — P' be the 
morphism defined in exercise 3.8. Choose a Weierstrass equation 


E:y? +aizy t+ asy = 2° + aex? + az + a6 


for €, where a1,...,a6 € k(C), and let c4, ce be the usual associated quan- 
tities [AEC III §1]. Prove that € — C splits over k if and only if one of the 
following three conditions is true. 

(i) ge(C) = {0} and cg is a sixth power in k(C). 

(ii) ge(C) = {1728} and cq is a fourth power in k(C). 

(iii) je(C) = {a} with a 4 0, 1728, and ce /ca is a square in k(C). 

(Keep in mind we are assuming that char(k) = 0, although this exercise 
remains true if char(k) > 5.) 

Let £/K be an elliptic curve defined over a function field K = k(C) by an 
equation of the form 


E:y=a°+Ac+B. 
Further define the “height of E” to be h(E) = h( A?) + h(B’). 
(a) Prove that for all P € E(k), 
4h(P) — 3h(E) < h(2P) < 4h(P) + h(E). 


(b) Prove that for all P,Q € E(4), 
4n(E) < h(P+Q) + h(P — Q) — 2h(P) — 2h(Q) < h(E). 


(c) Prove that for all P € E(K), 


5 


h(P) — 5h(P) < gh(Z). 


(In all three parts of this problem, the constants in front of the h(E)’s are 
far from best possible. See how much you can improve them.) 
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3.12. 


3.13. 


3.14. 


3.15. 


3.16. 


Let E/C(T) be the elliptic curve 
E:y? =a? —(T?+T)x+T”. 
Find all points in E(C(T)) of the form 
P= (ao + a1T, bo + 61T, co + C1 T] 


by substituting P into the equation for FE and solving for ao,...,c1. How 
many of these points are defined over Q(T)? 


With notation as in the statement of Lemma 5.5.2, prove that if 


deg D> 9+ 5d+ 5 deg A + 5 deg B, 


then the image of Vp in E(K) contains E(K,d). This provides a strength- 
ened version of (5.5.2). 


Let V be an irreducible projective variety defined over an algebraically 
closed field, and let y1,-y2 € V be distinct points. Prove that there exists 
an irreducible curve [. C V with 71,72 €T. 


This exercise describes the Mordell-Weil theorem (6.1) for split elliptic 
surfaces. Let C’/k be a curve, let Eo/k be an elliptic curve, and let € = 
Eo x C be the corresponding split elliptic surface. Further, let Map,(C, Eo) 
be the set of morphisms from C' to Eo defined over k. We use the group 
structure on Eo to make Map,(C, Eo) into a group in the usual way, (¢+ 
w)(t) = o(t) + W(t). Notice that the constant maps in Map,(C, Eo) form a 
subgroup isomorphic to Eo(k). 

(a) Prove that there is a natural isomorphism €(C/k) = Map,(C, Eo). In 
particular, the group of sections €(C’/k) contains a subgroup isomorphic 
to Eo(k). 

(b) Prove that the quotient group €(C/k)/Eo(k) is finitely generated. 

(c) If k is a number field, prove that €(C’/k) is finitely generated. 


Let k be a number field, and let C/k, Eo/k, € = Eo x C, and Map, (C, Eo) 
be as in the previous exercise. 

(a) Let 6 € Div(C) be a divisor of degree 1, and fix a height function hs : 
C(k) — R associated to 6. Prove that for any map ¢ € Map,(C, Eo), 


lim hEo (o(t)) 
tEC(k),hg(t)+00 he (t) 


= deg(¢). 


(b) Fix a basepoint to € C(k). Prove that there is a constant c so that 
if ¢ € C(k) satisfies hs(t) > c, then the map 


{¢ € Map,(C, Eo) : ¢(to) =O} —> Eo(k) 
co) — ot) 


is an injective homomorphism. 
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3.18. 


3.19. 


3.20. 
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(c) Suppose that the quotient group Map,(C, Eo)/Eo(k) has free rank r, 
say generated by the maps ¢1,...,¢r : C — Eo. Suppose further that the 
group E(k) has rank strictly less than r. Prove that C(k) is finite. 

(d) Fix an element b € k*, and let C/k and Eo/k be the curves 


C:X°+y*% =0Z®, Eo: y?z = 2° + be”. 


Prove that the group Map, (C, Eo)/Eo(k) has rank at least two by showing 
that the maps 


di ([X, Y, Z]) = [-—X?Z,Y°, 2°), @o((X Zi) (AY, ko 


give independent elements. Use this to prove that if rank Eo(k) < 1, then 
C(k) is finite. 

Let E be an elliptic curve defined over Q(T) that does not split over C(T). 
(a) Suppose that ki/Q and k2/Q are fields with the property that 


E(ki(T))=E(C(T)) and ~—-E(ke(T)) = E(C(T)). 


Let k = ki Oka. Prove that E(k(T)) = E(C(T)). Deduce that there is a 
smallest field with this property. We will call this field the field of definition 
for E(C(T)) and will denote it by ke. 

(b) Prove that kg is a finite extension of Q. 

(c) More precisely, find an explicit bound for the degree [kz : Q] that 
depends only on the rank of E(C(T)). 

(d) *Fix a Weierstrass equation for E of the form 


E:y=a2° + A(T)£+ BT) 


with A(T), B(T) € Z[T], and let A(T) = 4A(T)? + 27B(T)?. Prove that 
the extension kz/Q is unramified except possibly at 2, 3, and the primes 
dividing the discriminant of the polynomial A(T) € Z[T]. 
(a) Let f € k(P?). Prove that deg(div(f)) = 0, and deduce that the 
degree map deg : Pic(P?) — Z described in (7.1) is a well-defined homo- 
morphism. 
(b) Prove that the degree map deg : Pic(P?) — Z is an isomorphism. 
(c) Generalize (a) and (b) to P”. 
Let P = (0,0) € A’, and let f1, fo € k(P”) be rational functions satisfying 
fi(P) = fe(P) = 0. Let Ty and [2 be the curves fi = 0 and fe = 0 
respectively. Prove that I; and 2 intersect transversally at P if and only 
if the following three conditions are true: 

(i) Ty is non-singular at P; 

(ii) Pz is non-singular at P; 
(iii) the tangent line to; at P is distinct from the tangent line to I’ at P. 
For each of the following curves 1,2 C P?, calculate the local intersection 
index ([, -T'2)p at the point P = (0,0, 1]. (Hint. First dehomogenize by 
setting Z = 1.) 
(a) Ty: ¥?Z =X? 4 X?Z, T2:Y 
(b) Ty: ¥?Z= X°4 X7Z, T2:Y 
(c) Ty /Y?Z = X?, T2:Y 
(d) 11: ¥Y°Z=X°4+X°Z, T2:Y 
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3.21. 


3.22. 


3.23. 


3.24. 


3.25. 


“Let 7: S — C bea fibered surface. Prove that every fiber S; is connected. 
(You may want to take k = C.) 

Let 7: S > C bea fibered surface, let T C S be a non-singular irreducible 
horizontal curve, and let 6: — C be the restriction of a to I. 

(a) Let P€T andt = ¢(P) € C. Prove that 


(C . m*(t))p = ep(¢), 


where ep(¢) is the ramification index of ¢ : Tf — C at P (see [AEC, II §2]). 
(b) Prove that [- 2*(6) = deg(#) deg(6) for all divisors 6 € Div(C). 

(c) *Prove that [- 7*(6) = deg(¢) deg(6) remains true even if the irre- 
ducible horizontal curve [' is allowed to be singular. 

Let 7 : S — C be a fibered surface, and let 6 € Div(C) be a divisor of 
degree 0. Prove that D-7*6 = 0 for every divisor D € Div(S). (Hint. 
Use (8.1) and the previous exercise.) 

Generalize (8.3.1) as follows. Let z : S — C be a fibered surface, and 
suppose that the fiber S; consists of m components arranged in the shape 
of an n-gon with transversal intersections. In other words, 


1 ifi =j+1(modn), 
w(t) =Potlit+---4+@Pn-1 with T;-T; =< —2 ifi=j, 
0 otherwise. 


(a) Draw a picture illustrating this fiber, and show that the self-intersec- 
tion values T? = —2 follow from the values of [; - I; for i # 7. 

(b) Let J = (Ti - Ty)o<ij<n—1 be the incidence matrix of the fiber, and 
let Joo be the minor obtained by deleting the first row and column from I. 
Find the value of det(Joo) in terms of n. 

(c) Let k be an integer between 0 and n — 1, and let D € Div(S) be a 
divisor satisfying 


D-To=-1, D-Ty=1, D-T; =0 fori40,k. 


Find a fibral divisor 


n-1 
Pp = Sail such that (D+ ®p)-T; = 0 for allO<i<n-1; 


i=1 


that is, find an explicit formula for a; in terms of 7, k, and n. 


Let 7: S —C be a fibered surface. 

(a) Let I € Div(S) be an irreducible fibral divisor. Prove that there exists 
an irreducible horizontal divisor D € Div(S) satisfying D-T > 0. 

(b) Prove that there exists a horizontal divisor D € Div(S) with the prop- 
erty that D-I > 0 for every irreducible fibral divisor T € Div(S). 

(c) Let D € Div(S) be a divisor as in (b), and let t € C. Use the Nakai- 
Moishezon criterion (10.3.2) to prove that the divisor D + nx*(t) is ample 
for all sufficiently large integers n. 
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3.27. 


3.28. 


3.29. 


3.30. 
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Let 7: € + C be a minimal elliptic surface with associated elliptic curve 
E/K. Define an action of E(k) on Div(€) by having P € E(K) send a 
divisor D to the divisor tp(D). 

(a) Prove that the action of E(K) on Div(€) descends to give a well-defined 
action of E(k) on Pic(€). 

(b) For P,Q € E(K), let ®pg € Div(E) be a fibral divisor with the 
property (P + Q) — (P) — (Q) + (O) ~ ®pag. (See (9.5).) Prove that for a 
fixed Q € E(K), the map 


E(K) — Pic(€), P+ class ®p.g, 


is a one-cocycle from E(k) to Pic(€), where Pic(€) is an E(K)-module as 
described in (a). (See (9.5) for a stronger result for E(K)o.) 
Let P € E(K). Prove that P € E(K)o if and only if (P)- F = (O)- F for 
every fibral divisor F € Div(€). 
Let 7: S — C be a fibered surface. 
(a) Let [I Cc S be a curve with the property that 7 : T — C is an iso- 
morphism. Prove that [- 1*(t) = 1 for allt € C. Use this to deduce 
that 

I’. *(6) = deg(6) for all divisors 6 € Div(C). 


(b) Fix a point t € C. Prove that the image of a section a :C > S 
intersects exactly one component of the fiber S;. 

(c) Let F C Div(S) be a fibral divisor, and suppose that F = 7*6 for 
some 6 € Div(C)@Q. Suppose further that there exists asectiong : C — S. 
Prove that 6 € Div(C). 

(d) Let 6 € Div(C) be a divisor such that 7*6 is a principal divisor on €. 
Prove that 6 is a principal divisor on C. Deduce that 7 induces an injective 
homomorphism 7* : Pic(C’) — Pic(S). (This is easier if you assume that 
there exists a section ¢: C > S.) 

Let 7: € — C be an elliptic surface, let E/K be the associated elliptic 
curve over the function field K = k(C) of C, and let Pi,...,P, € E(K). 
Prove that if 


ni(Pi) +--+ +n,(P,) — n(O) € Div(€) 
is linearly equivalent to a fibral divisor, then 
[ni] Pi feet [ny |P, = O. 


This gives the converse to Proposition 9.2. 


Let S be a non-singular surface, let D € Div(S) be a divisor, and let H € 
Div(S) be an ample divisor. 

(a) Use the Nakai-Moishezon criterion (10.3.2) to prove that the divi- 
sor nH + D is ample for all sufficiently large integers n. 

(b) Use (a) to prove (10.5) for surfaces; that is, prove that D can be written 
as the difference of two very ample divisors on the surface S. 


Exercises 285 


3.31. 


3.32. 


3.33. 


Let V/k be a non-singular projective variety defined over a number field, 
let D € Div(V) be a positive divisor, and let hp : V(k) — R be an 
associated height function. (A divisor is positive if it can be written as a 
sum So niDi, where the D,;’s are irreducible subvarieties of V and the n;’s 
are positive.) 

(a) Prove that there is a constant c = c(V, D,hp) such that 


hp(P)>c for all PE V(k), P¢ D. 


(Note that c need not be positive.) 

(b) Give an example to show that (a) need not true for all P € V(k). 
(Hint. Take D to be the exceptional curve on P? blown up at one point. A 
harder example is to let D be the diagonal in C x C, where C is a curve of 
genus g > 2.) 


Let V/k be a non-singular projective variety defined over a number field, 
and let D,H € Div(V) be divisors with H ample and D algebraically 
equivalent to zero (see Hartshorne [1, exercise 1.7]). Prove that 


hp(P) 


lim = 0. 
PEV(k).hy(P)>0 ha (P) 


(This generalization of (10.3) is due to Lang.) 


Let E/k be an elliptic curve defined over a number field k. For any divi- 
sor 8 = 5~b,(P;) € Div(E), we define the canonical height hg associated 
to 2 by the formula 


he(P) = > bih(P — P,). 


Thus the usual canonical height h is the height associated to the divi- 
sor (O). 
(a) If the divisor 6 is symmetric, that is, [-1]*@ = 6, prove that 


Es ; 1 
ha(P) = lim —sha([n]P). 

(b) If the divisor @ is anti-symmetric, that is, [-1]*G = —G, prove that 
. 1 
hg(P) = lim |, hallr|P). 


Also prove in this case that the map hg : E(K) — R is a homomorphism. 
(c) Let B € Div(E) be a divisor of degree 0. Prove that there is a con- 
stant c = c(E, 8) so that 


lha(P)| <e\/h(P) for all P € E(k). 
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Let € — C be an elliptic surface defined over a number field k, and fix two 
points P,Q € E(k). Suppose that the base curve C is an elliptic curve, and 
let hc be the canonical height on C. With notation as in (11.1) and (11.3), 
prove that 


(Po. Que = (P,Q)ho(t) + O(y/he(t) ) 
for all t € C(k) with €; non-singular. 
This is a special case of (11.3.2b). 
Let K = k(C) be the function field of a curve over an algebraically closed 


field k, let E/K be an elliptic curve, and choose a Weierstrass equation 
for E/K of the form 


y =2°+Ar+B for some A, Be K. 
Let A = 4A® + 27B?, and for each point t € C let 


min{ord;(A?), ord:(B?)} 
12 


mt = ord;(A) — 12 


(Here [r] is the greatest integer in r.) Define the minimal discriminant 
divisor of E/K to be the divisor 


DeyK = Si ni(t) E Div(C). 
tEeC 


(This is the analogue of the minimal discriminant ideal for an elliptic curve 
defined over a number field [AEC, VIII §8}.) 


(a) Prove that Dx is a positive divisor and that it is independent of the 
choice of the Weierstrass equation for E/K. 

(b) Let € — C be a minimal elliptic surface associated to E. Prove that 
the fiber €; is non-singular if and only if ord:(Dz;x) = 0. 

(c) Prove that if Dg;~ = 0, then the j-invariant j(E+) is constant. 

(d) Prove that if Dg;~ =O and C = P?, then € splits over C. 

(e) Let C/k and E/K be given by 


C:v? =u*—7u’ + bu, E:y? =2° — Tuc + 6u’. 


Prove that Dg;« = 0 and that the associated elliptic surface E — C does 
not split. 

(f) For the example in (e), show that P = (u?,uv) € E(K) is a point of 
infinite order. 
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3.37. 


3.38. 


We continue with the notation from the previous exercise, with the addi- 
tional assumption that E does not split over K. Let g be the genus of C. 
For each point t € C, we take a Weierstrass equation for F that is minimal 
for the valuation ord; and we let E; be curve obtained by evaluating the 
coefficients at t. We define integers f; by 


1 if E; has a node (multiplicative reduction), 


0 if FE, is non-singular (good reduction), 
fe= 
2 if E; has a cusp (additive reduction). 


Then the conductor divisor of E/K is defined to be the quantity 


fey =~ filt) € Div(C). 
tec 
(For another description of the conductor, see ([V §10).) 
(a) Prove that ordi (fzsx) < ord:(De/x) for all t € C, and deduce that 


deg(fzsx) < deg(De x). 


(b) Prove that 
deg(De/x) < 6deg(fz/x) + 12(g — 1). 


This inequality is a precise function field analogue of Szpiro’s conjecture 
([V.10.6). It was originally discovered by Kodaira (see Shioda [3, Proposi- 
tion 2.8]) long before Szpiro formulated his conjecture. 


“This exercise is the function field analogue of Lang’s conjectural lower 
bound for the canonical height [AEC, VIII.9.9]. We continue with the 
notation from the previous two exercises. Let g be the genus of C, and 
let h: E(K) > R be the canonical height on E (4.3, 9.3). 
(a) Prove that there is a constant ci(g) > 0, depending only on g, such that 
if P € E(K) is a point of infinite order, then h(P) > ci(g) deg Desk. 
(b) Prove that there is an absolute constant cz > 0 so that if deg Dz/x > 
2g—2 and if P € E(K) is a point of infinite order, then A(P) > cedeg Desk. 
Let C be a non-singular hyperelliptic curve of genus two given by an equa- 
tion 

C:y? = ax” + br + cx® + dx® + ex + f, 


let i: C — C be the involution i(z, y) = (x, —y), and let Po € C be the 
point at infinity on C. (See [AEC, 1.2.5.1 and exercise 2.14] for basic facts 
about hyperelliptic curves.) 

(a) Prove that i(Po) = Po. Find all other points satisfying i(P) = P. 

(b) Let P,Q € C be any two points. Prove that the divisors (P) + (i(P)) 
and (Q) + (i(Q)) are linearly equivalent. 

(c) Let D € Div°(C). Prove that there exist points P,Q € C such that 
D ~ (P) + (Q) — 2(Po). 

(d) Prove that the points P and Q in (c) are uniquely determined by D 
unless P = i(Q). 

(e) Prove that Pic(C)[2] is finite. More precisely, prove that it is isomor- 
phic to (Z/2Z)*. 

(f) Generalize (a)-(e) to the case of a hyperelliptic curve C : y? = f(x) of 
genus g, where f(z) is a polynomial of degree 2g + 1 with distinct roots. 
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Let K = k(C) be the function field of a curve C of genus g, let SC C bea 
non-empty finite set of points of C, let Rs be the ring of S-integers of K, 
and let E/K be an elliptic curve given by a Weierstrass equation 


E:y=2°+Ar+B ~ with A,B e€ Rs and A = 4A° 4 27B7 © R¥. 


Let P € E(K) be a point satisfying z(P), y(P) € Rs. Prove that 
h(P) = h(a(P)) < 4max{2g — 2+ #5,0} + a(n) + h(A®) + h(B’)). 


(Hint. Use (12.3) and elementary properties of height functions.) 


Let € — C be a minimal elliptic surface, let E/K be the associated elliptic 
curve, and let E(K’)o be the subgroup of E(K) described in §9. 

(a) Suppose that € has non-constant j-invariant; that is, the function 
je: C — P' defined in exercise 3.8 is non-constant. Prove that E(K)o has 
no non-trivial torsion. 

(b) For each integer m € {2,3,6}, give an example of a non-split elliptic 
surface (necessarily with constant j-invariant) such that E(4)o contains a 
point of exact order m. Prove that these are the only orders possible. 


CHAPTER IV 


The Néron Model 


Let R be a discrete valuation ring with maximal ideal p and fraction field Kk, 
and let E/K be an elliptic curve given by a Weierstrass equation 


BE: y? + ayzy + azgy = x? + agu? +a4r + a6, 


say with coefficients a1, a@2,@3,a4,ag € R. This equation can be used to 
define a closed subscheme W Cc P%. An elementary property of closed 
subschemes of projective space says that every point of E(i) extends to 
give a point of W(R), that is, a section Spec(R) — W. 

An important property of the elliptic curve F is that it has the struc- 
ture of a group variety, which means that there is a group law given by 
a morphism E x FE — E. This group law will extend to a rational map 
W xr W > W, but in general it will not be a morphism, so W will not be 
a group scheme over R. However, if we discard all of the singular points on 
the special fiber of W (i.e., the singular points on the reduction of EF mod- 
ulo p) and call the resulting scheme W®, then we will prove that the group 
law on E does extend to a morphism W® xp W° — W°. This makes W® 
into a group scheme over R, but, unfortunately, we may have lost the point 
extension property. In other words, not every point of E(K) will extend 
to give a point in W°(R). 

A Néron model for E/K is a scheme €/R which has both of these 
desirable properties. Thus every point in E(k) extends to a point in €(R), 
and further the group law on E extends to a morphism E x rp E > € which 
makes € into a group scheme over R. It is by no means clear that such 
a scheme exists. Our main goals in this chapter are to construct Néron 
models, describe what they look like, and give some applications. 

The material in this chapter is of a more technical nature than most of 
the rest of this book. We will assume that the reader has some familiarity 
with basic scheme theory, as described for example in Hartshorne [1, Ch. IT] 
or Eisenbud-Harris [1]. When we need more advanced material, we will give 
at least a brief explanation together with a reference for further reading. 

We begin in $1 with a brief discussion of group varieties. This ma- 
terial is not strictly necessary for the remainder of the chapter but may 
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prove helpful for those readers who have not studied group schemes pre- 
viously. Section 2 contains some basic material on abstract schemes and 
schemes over a base S, including material on fiber products, special fibers, 
regularity, properness, and smoothness. In §3 we define group schemes 
and describe some of their elementary properties. Section 4 is devoted to 
the theory of arithmetic surfaces. An arithmetic surface C over a discrete 
valuation ring R is a “nice” scheme whose generic fiber is a non-singular 
curve C/K. We give several examples and prove that the smooth part @° 
of a regular proper arithmetic surface C has the point extension prop- 
erty C(K) = ©°(R). We also state the fundamental existence theorems 
concerning minimal regular models of arithmetic surfaces. 

In §5 we define Néron models and show that the smooth part W? 

of a Weierstrass equation is a group scheme. In some cases, for example 
when E/K has good reduction, this will imply that W® is a Néron model 
for E/K. The general construction of Néron models is given in §6, where 
we prove that the smooth part €/R of a minimal proper regular model C/R 
for E/K is a Néron model. The proof is quite technical and may be omitted 
on first reading. This is especially true for those readers who are mainly 
interested in applications of the theory of Néron models. Frequently, it is 
enough to know that a Néron model for E'/K is a group scheme E over R 
whose generic fiber is K and which has the property that €(R) = E(K). 
_ We next take up the question of what the special fibers é (mod p) and 
€ (mod p) look like. Section 7 contains a discussion of intersection theory 
on general arithmetic surfaces, and then in §8 we apply this theory to give 
the Kodaira-Néron classification of the special fibers of an elliptic fibration. 
Section 9 contains a description and verification of an algorithm of Tate 
which gives an efficient method of computing the special fiber C (mod p) 
from a given Weierstrass equation. In §10 we define the conductor of an 
elliptic curve and give some of its properties. Finally, in §11, we state 
and mostly verify an important formula of Ogg which gives a relation be- 
tween the conductor, the minimal discriminant, and the special fiber of the 
minimal proper regular model of an elliptic curve. 

In order to simplify the discussion in this chapter, we will make the 
following convention: 


All Dedekind domains and all discrete 
valuation rings have perfect residue fields. 


Notice this includes Dedekind domains and discrete valuation rings whose 
residue fields are finite, which is the case that will mainly interest us. 


81. Group Varieties 
A group variety is an algebraic variety that is also a group. In slightly 


fancier language, a group variety is a group in the category of algebraic 
varieties. This means that the group law is given by algebraic functions. 
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Definition. A group variety (or algebraic group) is an algebraic variety G 
and two morphisms 


uw: GxG—-G and i:G—-G 
satisfying the following group axioms: 


(i) There is a point O € G such that «(P,O) = u(O, P) = P 
for all PEG. 
(ii) u(P,i(P)) = w(i(P), P) = O for all PEG. 
(iii) w(P, u(Q, R)) = u(u(P, Q), R) for all P,Q, REG. 
G is called a commutative group variety if it further satisfies 


(iv) “CP, Q) = w(Q, P) for all P,Q EG. 
The group variety G is defined over K if G is defined over K, the mor- 
phisms yw and i are defined over kK’, and O € G(K). 


Example 1.1.1. An elliptic curve E/K is a commutative group variety 
defined over K. This follows from [AEC, JII.2.2] and [AEC, JI1.3.6]. 


Example 1.1.2. The additive group Ga and the multiplicative group Gm 
are the commutative group varieties 
G, ZA! and Gm = {zr € Al: x £0}. 
The group laws on G, and Gy are defined by the formulas 
be: Ga xX Ga — Ga and Le: Gm X Gn — Ga 
(t,y) -oety (x,y) > ay. 
The additive group is clearly an affine variety. The multiplicative group is 
also an affine variety, since there is an isomorphism 
Gm — {(x,y) € A? : ey =1}, xr (x,1/2). 
Example 1.1.3. The general linear group GL,, is defined by 
Ti1 7" Lin 
2 
GL, =(M= Dota, : EA” : det(M) £0 
In1 tt Enn 


It is a group variety with group law given by matrix multiplication. Note 
that the inverse map i(M) = M7? is a morphism on GL», since the func- 
tion 1/det(M) is a regular function on GL,,. Just as with the multiplica- 
tive group, we observe that GL,, is an affine variety, since it is the com- 
plement of a hypersurface in A”. (In general, the complement of f = 0 
in A” is affine, since it is isomorphic to {(x,y) € A™*! : yf(x) =1}.) We 
have GL; = Gp, and GL,, is non-commutative for n > 2. 
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Definition. A homomorphism of group varieties 6 : G — H is a mor- 
phism of varieties that is also a homomorphism of groups; that is, ¢ is a 
morphism, ¢(Og) = Ou, and 


¢(ue(P,Q)) = nu ($(P),9(Q)) for all P,Q EG. 


Example 1.2.1. An isogeny E; — E> of elliptic curves is a homomor- 
phism of group varieties. This follows from [AEC, III.4.8], which says that 
any morphism ¢ : E, — Ep» satisfying ¢(Oz,) = Oz, is automatically a 
homomorphism. 


Example 1.2.2. The determinant map defines a homomorphism of group 


varieties 
det : GL, — Gy. 


The kernel of the determinant map is another affine group variety called 
the special linear group, 


SL, = {M €GL, : det(M) = 1}. 


In general, an algebraic subgroup of GL, is called a linear group. It turns 
out that every connected affine group variety is a linear group (Water- 
house [1, §3.4]). For some other examples of linear groups, see exercises 4.1 
and 4.2. 


Proposition 1.3. Let G be a group variety defined over a field K. Then 
the set of K-rational points G(K) is a subgroup of G. 


PROOF. The identity element O of G is in G(K) by definition. Further, the 
morphisms up: G x G— G andi:G-—>G are defined over K, so G(K) is 
closed under the group operations. Hence G(K/) is a subgroup of G. Oo 


Example 1.4.1. Let E/K be an elliptic curve. Then E(k) is the group 
of K-rational points of E. If K is a number field, then E(K) is a finitely 
generated group [AEC, VIII.6.7]. 


Example 1.4.2. For any field K we have Ga(K) = K and G,,(K) = K*. 
Similarly, GL, (K) is the group of n x n invertible matrices with coefficients 
in K. 


Proposition 1.5. Let G be a group variety. 

(a) G is a non-singular variety. 

(b) Every connected component of G is irreducible. 

(c) The connected component of G which contains the identity element is 
a normal subgroup of G of finite index. 


Definition. Let G be a group variety. The connnected component of G 
containing the identity element is denoted by G® and is called the identity 
component of G. The quotient group G/G® called the group of components 
of G. 
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Proor (of Proposition 1.5). (a) There is a Zariski open subset U C G 
which is non-singular (Hartshorne [1, 1.5.3]). For any P € G, let tp: G > 
G be the translation-by-P map, Tp(Q) = u(P,Q). Note that rp is an 
isomorphism from G to itself. Now G is covered by the non-singular open 
sets Tp(U), P € G, so every point of G is non-singular. 

(b) Suppose that G has a connected component that consists of more 
than one irreducible component. Then that connected component would 
contain distinct irreducible components that have a point in common, and 
the common point would be singular. This contradicts (a). Hence every 
connected component of G is irreducible. 

(c) A variety has only finitely many connected components, since it actu- 
ally has only finitely many irreducible components (Hartshorne [1, I.1.6]). 
We label the connected components of Gas G°,G!,...,G", where G® is the 
connected component of G containing the identity element. Let P € G®. 
The translation-by-P map Tp permutes the connected components of G, 
so Tp(G°) = G! for some j. But 


P = Tp(O) € tp(G°) = G, 


so the connected components G° and G/ have the common point P. Hence 
G°® =G). This means that w(P,Q) = Tp(Q) € G® for all P,Q € G®. Sim- 
ilarly, O € G° Ni(G®), so i(G°) = G®. This proves that G®° is a subgroup 
of G. 

Next, fix a point @ € G and consider the conjugation-by-Q map 


:G—>G, — OP) =pH(i(Q), u(P.Q)). 


@ is an automorphism of G, so it permutes the components of G. Fur- 
ther, 6(O) = O, so as above we conclude that ¢(G°) = G°. Therefore G° 
is a normal subgroup of G. 

Finally, for each 0 < 7 <n we fix a point P; € G/, Then the maps 


Oy 2G == G, O(P) =p(P, (2); 


permute the components of G and satisfy ¢;(P;) = O, from which we 
conclude that $;(G’) = G°. Hence Po,...,P, includes a complete set of 
coset representatives for G/G®°, so G® has finite index in G. Oo 


We have seen above (1.1.1, 1.1.2) that the additive group, the multi- 
plicative group, and elliptic curves are group varieties. We will now prove 
that these are the only connected group varieties of dimension one. 


Theorem 1.6. Let G be a connected group variety of dimension one de- 
fined over an algebraically closed field. Then either G = Gz, G & Gy, or G 
is an elliptic curve. (For non-algebraically closed fields, see exercise 4.13.) 


Before beginning the proof, we prove a lemma that gives conditions 
under which a curve has only finitely many automorphisms. 
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Lemma 1.7. Let C be a non-singular projective curve of genus g, let S Cc 
C be a finite set of points, and suppose that S satisfies one of the following 
conditions: 


Gi) #S >3ifg=0. (ii) #S>1lifg=1. (iii) S is arbitrary if g > 2. 


Then 
def 


Aut(C; S) {@ € Aut(C) : o(S) c S} 


is a finite set. 


Proor. Suppose first that g = 0, so we can take C = P!. Fix three 
distinct points P,, P2,P3; € S. Every automorphism of P! is given by a 
linear fractional transformation (Hartshorne [1, II.7.1.1]) 


o([x, y)) = [ax + by, cx + dy]. 


An automorphism ¢ will thus be determined by the images of P;, P2, P3, 
which proves that the map (of sets) 


Aut(C;S) — S?, br— (0(P1), (P2), 6( Ps) 


is injective. But S is finite by assumption, so Aut(C; S$) is finite. 

Next suppose that g = 1. We make C into an elliptic curve by taking 
the origin O to be a point in S. Then every isomorphism C — C is a 
translation followed by an isomorphism fixing O [AEC, 1.4.7]. But there 
are only finitely many isomorphisms C — C that fix O [AEC, III.10.1], so 
the map (of sets) 


Aut(C;S) +C, $+ (0), 


is finite-to-one. Since ¢(O) € S and S is finite, this proves that Aut(C; 5) 
is finite. 

Finally, we recall a theorem of Hurwitz which says that if a curve 
has genus g > 2, then it has at most 84(g — 1) automorphisms. (See 
Hartshorne [1, exercises [V.2.5, IV.5.2, V.1.11].) This completes the proof 
of Lemma 1.7. oO 


PROOF (of Theorem 1.6). We know that the variety G is non-singular, irre- 
ducible (1.5), and has dimension one, so we can embed it as a Zariski open 
subset of a non-singular projective curve, say G C C (Hartshorne [1, I §6]). 
Let S=CN\G be the complement of G in C. 

For every point P € G, the translation-by-P map Tp : G — G is an 
automorphism of G as a variety. Then rp induces a rational map Tp : C > 
C which extends to an isomorphism since C is non-singular [AEC, II.2.1]. 
Clearly, we have tTp(S) C S, since rp(G) = G. In this way, we obtain an 
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inclusion G — Aut(C; S$) given by P+ rp, where Aut(C; S) is as in (1.7). 
But G is a variety of dimension one, so it has infinitely many points. It 
follows from (1.7) that C has genus less than 2, that S = 0 if C has genus 1, 
and #S < 2 if C has genus 0. There are thus four cases to consider. 

Suppose first that C has genus 0 and S = 9. Then G = C = P', and 
the group law on G is a morphism 


p:P! xP! — Pl. 


Such a map has the form (x,y) = [f(x,y), 9(x, y)]. where f and g are 
bihomogeneous polynomials. The fact that w is a morphism means that f 
and g can have no common roots in P! x P!, which implies that 4 must 
look like either 


u(x,y) = [f(x),g(x)] or (x y) = [F(y), 9(y)]- 


But then either 4(O, P) or (P,O) is constant, so 4 cannot define a group 
law. This proves that it is not possible to have G = P!. 

Next suppose that C’ has genus 0 and that #S = 1. Then we can 
identify C with P! in such a way that the point in S is the point at infinity 
and the identity element of G is the point 0. In other words, we have C = 
P!,G=A!, and 0€ A! is the identity element of G. The group law on G 
is a morphism 

p:Alx A'— Al, 


so is a polynomial map, p(z,y) € K[z,y]. For every fixed value of 2, 
we know that the map y + p(z,y) is an automorphism of A!, so pu(z, y) 
must be linear in y. Similarly for x +> yu(x, y), so u(x, y) is also linear in zx. 
Further, (2,0) = x and (0, y) = y, so we conclude that y has the form 


w(z,y) =cx+yt+cry for some c € K. 


Finally, we observe that if c 4 0, then c~! would not have an inverse, 
since u(—c~!,y) = —c7! is constant. Hence c = 0 and u(z,y) = x+y, 
which proves that G = G,. 

The next case to consider is a curve C of genus 0 and #S = 2. This 
time we identify C with P! so that the two points in S are 0 and oo and 
so that the identity element of G is the point 1. Then the group law 
on G = A! \ {0} is a morphism 


we: (At \ {0}) x (AT \ {0}) — (At X {0}), 
so p is a Laurent polynomial, u(r,y) € K[x,27',y, y+]. As above, the 


map y + u(xz,y) is an automorphism for every fixed x, which means it 
must have the form 


w(a,y) a(x) +0(2)y or pu(x,y) = ax) + B(a)y?. 
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We further know that u(1,y) = y, which rules out the second possibility 
and tells us that a(1) = 0 and (1) = 1. In particular, b(x) 4 0. 
If a(x) # 0, then we can find ana € A* so that a(a) # 0 and f(a) £0. 


Then | 
(of) = 


which contradicts the fact that u(G x G) C G. Therefore a(x) = 0. We 
have now shown that p(z,y) = b(x)y. Reversing the roles of x and y 
and using the fact that (1,1) = 1, we conclude that (2, y) = xy, which 
completes the proof that G = G,,. 

It remains to consider the case that C’ = G is a curve of genus 1. 
The group variety G has an identity element O, and we use this point to 
give (C,O) the structure of an elliptic curve. It remains to show that the 
identification G and Cas curves is also an isomorphism of groups. In other 
words, we need to prove that 


Uc(P,Q)=P+Q and ig(P)=—P, 


where ig : G x G — G is the given group law on G and + is the group law 
on the elliptic curve (C,O). Note that we do not assume, a priori, that G 
is commutative. 

Consider the map 


o:CxC—C, O(P,Q) = uc(P,Q) — P-@Q. 


The point O € C is the identity element for both group laws, so we find 
that ¢(P,O) = O and ¢(O,Q) = O for all P,Q € C. It follows from 
an elementary rigidity lemma (1.8) which we will prove below that ¢ is 
constant. Hence 


o(P,Q) = (0,0) =O, andso ype(P,Q)=P+Q forall P.QEC. 
Finally, we observe that 
P+ig(P) =pc(P,ic(P)) =O, 
which proves that ig(P) = —P. This completes the proof of (1.6), subject 


to our proving the following lemma. im) 


Lemma 1.8. (Rigidity Lemma) Let C),C2,C3 be irreducible projective 
curves, and let 


@:C, x Co — Cs 


be a morphism. Suppose that there are points P, € C, and P, € C2 with 
the property that each of ¢(P, x C2) and $(C) x P2) consists of a single 
point. Then @ is a constant map. 


Proor. We are given that @(P; x C2) consists of a single point, say 


o(P, x C2) = R. 
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Choose a point R’ € C3, R’ # R, and consider the set 

U; => {Qe Cy : R' ¢ d(Q x C2)}. 
Notice that the complement of U; in C; is the set 

Cy UL, = proj, (@~'(R’)), 

where proj, : Ci x C2 — C) is projection onto the first factor. The pro- 
jection map sends closed sets to closed sets. This follows from the fact 
that C2 is projective, hence proper (Hartshorne [1, II.4.9]), and the defi- 
nition of properness implies that any projection V x C2 — V is a closed 
morphism. 

Now the set ¢~'(R’) is closed, so the same is true of proj, (¢~'(R’)), 
which shows that U, is an open subset of C). Further, it is clear that 
that P, € U;, so U, is non-empty. For any Q € Uj, we consider the 
morphism 

C2 ——7 C3, Sr o(Q, 8S). 
The fact that Q € U, tells us that R’ is not in the image of this map. 
In other words, this map is not surjective, so it follows from [AEC, IJ.2.3] 
that it is constant. In other words, if Q € U,, then ¢(Q, S) is independent 
of S € Cy. Equivalently, ¢(Q x C2) consists of a single point. 

We now repeat the above argument using the fact that @(C) = P2) 
consists of one point. Doing so yields a non-empty open set U2 C C2 with 
the property that for all S € Us, the set d(C) x S) consists of a single point. 

Combining these two facts, we find that ¢(U; x U2) consists of one 
point. But U; x U2 is Zariski dense in C, x C2, and a morphism is deter- 
mined by its values on a Zariski dense set. Therefore ¢ is constant. 

oO 
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In this section we are going to review some basic notions about schemes, 
especially schemes over a fixed base scheme. We assume that the reader 
has some familiarity with this material, as covered for example in Hart- 
shorne [1, IT §§2,3] or Eisenbud-Harris [1]. 
Definition. Let S bea fixed scheme. An S-scheme is a scheme X equipped 
with a morphism X — S. A morphism of S-schemes (or S-morphism) is a 
morphism X — Y so that the diagram 

xX 


Y 
NS we 
S 


is commutative. If S = Spec(R), we will often refer to R-schemes and R- 
morphisms instead of Spec(R)-schemes and Spec(R)-morphisms. 
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Intuitively, an S-scheme X — S can be viewed as an algebraic family 
of schemes, namely the family of fibers X, parametrized by the points 
s €S. We have seen an example of this in Chapter III, where the elliptic 
surface € — C is a C-scheme whose fibers €; form a family of elliptic 
curves. 

Two other important examples are provided by affine and projective 
space over a ring R. These are defined to be 


Ab = Spec R[z1,...,2n] and PR = Proj R[zo,..-,@n]. 


See Hartshorne [1, I.2.5.1] for details. 

In Chapter III we studied the group of sections C — € to the elliptic 
surface. Similarly, we can look at the set of sections S — X of an S-scheme. 
These are precisely the S-morphisms from S to X. More generally, for 
any S-scheme T’, we can consider the set of S-morphisms from T to X. 


Definition. Let X and T be S-schemes. The set of T-valued points of X 
is the set 


X(T) = Homs(T, X) = {S-morphisms T — X}. 


If T = S, we will sometimes call X(S) the set of sections of the S-scheme X. 
Similarly, if S = T = Spec(R), we will refer to the R-valued points of X 
and write X(R). 


Example 2.1.1. Let K be a field, let S = Spec(A’), and let X/K be an 
affine scheme, say given by equations 


ffs tS fee With fipes ote Se Blade 
Then 


X(S) = {K-morphisms Spec(k) + X} 

Klay,...,2n] = 
(fi,---,fr) x} 

={PEK*: fi(P)=---= f,(P) =O}. 


& {K -algebra homomorphisms 


Thus X(S') agrees with our intuition of what X(k) should be. More gen- 
erally, if R is any ring and X C Aj is an affine scheme given by equations 
fi =-:- =f, =O0with f; © R{x1,...,2,], then X(R) is naturally identified 
with the set of n-tuples (71,...,2@,) € R” satisfying the equations. 


Remark 2.1.2. Each S-scheme X defines a functor Fx on the category 
of S-schemes by assigning to an S-scheme 7’ the set of T-valued points 
of X. Thus 


Fy : (S-schemes) — (Sets), Tr X(T). 
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If ¢: T — T’ is an S-morphism of S-schemes, the associated map F'x(@) 
is given by composition, 


Fx(@): Fx(T’) — Fx(T), o> a0 ¢. 


Notice that Fy is a contravariant (i.e., arrow reversing) functor. It is 
a basic categorical fact (called Yoneda’s lemma, see Eisenbud-Harris [1, 
Lemma IV.1]) that the functor Fy determines the scheme X. Similarly, 
morphisms of functors Fy — Fy correspond bijectively with S-morphisms 
X — Y. We will not make use of this functorial approach, but the reader 
should be aware that it is a convenient language which is in common usage. 


Next we describe one of the most important constructions of algebraic 
geometry. 


Definition. Let X and Y be S-schemes. The fiber product of X and Y 
over S is an S-scheme, denoted X xg Y, together with projection mor- 
phisms 

py: X Xs ¥ HX and p2:X xg Y —Y 


over S' with the following universal property: 


Let Z be an S-scheme, and let f: Z — X andg:Z -— Y 
be S-morphisms. Then there exists a unique S-mor- 
phism Z — X xs Y so that the following diagram com- 
mutes: 


f g 
a i 
Mo es. we 2s “Ye 


The fiber product exists and is unique up to unique isomorphism; see Hart- 
shorne [1, IJ.3.3] or Eisenbud-Harris [1, I.C.i, [V.B]). If S = Spec R, we will 
often write X xpY. 


The fiber product is the smallest scheme that fits into the commutative 
diagram 


Xx VY Bhs X 


ee 


In some sense, X xg Y should “look like” the set of ordered pairs (x, y) 
having the property that x and y have the same image in S. This is 
literally true in the category of sets, but care must be taken when applying 
this intuition in the category of schemes. In fact, X xs Y will generally 
be quite large, even when S consists of a single point; see for example 
Hartshorne [1, II, exercise 3.1]. 
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Example 2.2.1. Let s € S, and let Y = {s} — S be the subscheme of S$ 
consisting of the point s. More precisely, if we write k(s) = Os5./Ms,s for 
the residue field of the local ring at s, then Y is the scheme Spec k(s). The 
fiber of X over s is defined to be the scheme 
X, & X xs {s}. 

It is a scheme over k(s). In this case the underlying topological space 
of X xg {s} actually equals the set of points x € X such that the image 
of x in S is s, see Hartshorne [1, II exercise 3.10]. Thus at the level of 
points, this definition of the fiber X, agrees with our intuition of what the 
fiber should be. 


Example 2.2.2. Let R be a ring, let p be a maximal ideal of R, and let X 
be an R-scheme. Then the fiber 


Xp =X XRP 


is the reduction of X modulo p. It is a scheme over the residue field R/p. 
This agrees with our intuition, since an (affine) scheme X over R is defined 
by a system of polynomial equations with coefficients in R, and Xy is the 
scheme defined by reducing the coefficients of the polynomials modulo p. 


Example 2.2.3. Let R be an integral domain, and let 7 = (0) € SpecR 
be the generic point of Spec R. If X is an R-scheme, then the fiber 

Xy =X XR n 
is called the generic fiber of X. It is a scheme over the fraction field K 
of R. In particular, if R is a discrete valuation ring with maximal ideal p, 
then X has two fibers, its generic fiber X,/K and its special (or closed) 
fiber Xp/k, where k = R/p is the residue field of R. 

For example, suppose that X C P% is given by a single homogeneous 
equation f(x,y, z) = 0 with coefficients in R. Then the generic fiber X, C 
PZ, is the variety defined by the same equation f(a, y, z) = 0, and the 
special fiber X, C P2 is the variety defined by the equation f(z,y, z) = 0, 
where f is obtained by reducing the coefficients of f modulo p. 

Example 2.3. Let 7: X — S be an S-scheme. In the definition of the 
fiber product, if we take Z = X and f and g to be the identity map X — X, 
then we obtain the diagonal morphism 

bx :X — X xs X; 


that is, 6x is the unique map to the fiber product with the property that p;0 
6x and p20 6x are each the identity map on X. 

More generally, let ¢ : X — Y be an S-morphism. Then the graph 
of @ is the unique morphism 


VED one eres 


such that p; o 6g is the identity map on X and pz 06g = ¢. Notice the 
diagonal morphism is the graph of the identity map X — X. 
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Example 2.4. Let 7 : € — C be an elliptic surface, say defined over 
an algebraically closed field k. Then the fiber product € xc €, or more 
precisely, the set of k-valued points on the fiber product, is the set 


(E xo €)(k) = {(P,Q) : P,Q € &(k) and x(P) = x(Q)}. 


Thus (€ xc €)(k) consists of pairs of points which lie on the same fiber. 
In particular, if P and Q lie on a non-singular fiber €;, then we can add 
them using the group law on €;. In this way (most of) the fiber prod- 
uct (€ xc €)(k) becomes a group. 


Example 2.5. Recall that every scheme S admits a unique morphism S —> 
Spec Z (Hartshorne [1, exercise II.2.4]). Affine and projective space over S 
are defined to be 


53 =AzZXz5 ~~ and 5 = Pz Xz. 


Projection onto the second factor makes AZ and P% into S-schemes. Notice 
that if S = Spec R is an affine scheme, then AG ~ AR and PZ = Ph, so 
these definitions are compatible with the definitions of affine and projective 
space over a ring. 


We are now faced with the task of discussing three important proper- 
ties of schemes and morphisms, namely regularity, properness, and smooth- 
ness. The definition of each of these properties is somewhat technical, and 
in truth we will make very little use of the formal definitions of properness 
and smoothness in subsequent sections. On the other hand, the intuitions 
underlying all of these properties are quite easy to understand, especially 
if one works in a “nice” setting. So we are going to begin with an informal 
discussion, including examples and basic material which we will give with- 
out proof. This discussion should suffice for reading the remainder of this 
chapter, except possibly for parts of §6. Then, at the end of this section, 
we will give precise definitions and provide references for further reading. 


Intuitive “Definitions.” A scheme X is regular if it is non-singular, by 
which we mean that every point of X has a tangent space of the correct 
dimension. 

A morphism of schemes X — S is proper if all of its fibers are complete 
and separated. (These are algebraic analogues of compact and Hausdorff.) 
Essentially, this means that the fibers of X — S are not missing any points 
and do not have too many points. We also say that X is a proper S-scheme. 

A morphism of schemes X — S is smooth if all of its fibers are non- 
singular, or, to put it another way, if X is a family of regular schemes. We 
also say that X is a smooth S-scheme. 


In order to define regularity, we recall that the Krull dimension of a 
ring A is the largest integer d such that there is a chain of distinct prime 
ideals of A, 

Mo C My C--- CMe. 
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A local ring A with maximal ideal 9M is called regular if the dimension 
of IN/IN? as an A/Mt-vector space is equal to the Krull dimension of A. (See 
Matsumura [1, Ch. 7] or Atiyah-MacDonald [1, Ch. 11] for basic material on 
regular local rings.) Intuitively, I%/IM? is the cotangent space of Spec(A) 
at the point St, and the regularity of A is an assertion that 92 is a non- 
singular point of Spec(A). For arbitrary schemes one defines dimension 
and regularity in terms of the local rings as follows: 


Definition. The dimension of a point P of a scheme X is the Krull di- 
mension of the local ring Op at P. If every closed point of X has the same 
dimension, we call this the dimension of X. 


Definition. A point P of a scheme X is said to be regular (or non- 
singular) if the local ring Op is a regular local ring. The scheme X is 
regular (or non-singular) if every point of X is regular. In fact, it suffices 
to check that every closed point is regular; see exercise 4.5. 


Example 2.6.1. Let R be a Dedekind domain. Then Spec(R) is a regular 
scheme of dimension one. To see this, note that in a Dedekind domain, 
every non-zero prime ideal is maximal by definition. Hence the longest 
chain of prime ideals is (0) C p, so R has dimension one. Further, each 
localization R, is a discrete valuation ring, so its maximal ideal My is 
principal. It follows that M, /M; has dimension one as an R,/M,-vector 
space, so /t, is regular. 


Example 2.6.2. If R is a regular local ring, or more generally if Spec(R) 
is regular, then both A}% and P% are regular schemes. 


Example 2.6.3. Let R be a discrete valuation ring, let 7 be a uniformizer 
for R, and assume that 2,3 € R*. Let a € R, and define a scheme X C Be 
by the equation 
X:yz=2? +42’. 

Then X is a regular scheme if and only if a 0 (mod m), To see this, one 
first checks that the only possible singular point is the point y = [0,0, 1] 
on the special fiber X, and that this can only occur if a = 0(mod 7). 
Dehomogenizing the equation by setting z = 1, we find that the maximal 
ideal M, of the local ring O, is generated by x, y, and 7, and that these 
quantities are related by the equation 


y? =z? +a. 
If a #0 (mod 7), then a is itself a uniformizer for R. Hence 


néeaR=(y?-x)RC Me, 


so x and y generate M,, /M2, which shows that 0, is regular. Conversely, 
if a = 0 (mod 7”), then M,/M? cannot be generated by fewer than three 
elements, so ©, is not regular. 
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Next we look at proper morphisms, which, recall, are supposed to be 
morphisms X — S whose fibers are separated and complete. For example, 
suppose that we are given a “curve” C,, a point 7 € C, and a commutative 


diagram of morphisms es 
Cyny — XxX 


| 


Co ‘gla og. 


If X — S is a proper morphism, then the fiber of X over f(y) is separated 
and complete, so there should be a unique way to extend F to all of C. This 
statement is essentially the following valuative criterion for properness. 


Theorem 2.7. (Valuative Criterion of Properness) Let ¢: X — S be 
a morphism of finite type of Noetherian schemes. The map ¢ is proper if 
and only if for every (discrete) valuation ring R with fraction field K and 
every commutative square of morphisms 


Spec(kK) —> X 


| le 
Spec(R) — S, 
there is a unique morphism Spec(R) — X fitting into the diagram. (In 


other words, there is a unique morphism Spec(R) — X so that the compo- 
sition Spec(R) — X — S agrees with the bottom line of the square.) 


Proor. Hartshorne [1, I1.4.7]. See also Hartshorne [1, exercise II.4.11] for 
the assertion that it suffices to consider only discrete valuation rings. Oo 


In order to better understand what the valuative criterion is saying, we 
note that if R is a discrete valuation ring with fraction field A, then Spec(R) 
is a regular one-dimensional scheme (2.6.1), and Spec(A’) is Spec(R) with 
its closed point removed. Thus Spec(/’) looks like a curve with one point 
removed. 

An important collection of proper S-schemes is the set of projective 
schemes over S,, as described in the following result. 


Theorem 2.8. Let S be a Noetherian scheme, and let X C P§ bea 
closed subscheme of projective space over S. Then X is proper over S. In 
particular, P itself is proper over S. 


PROOF. See Hartshorne [1, IT.4.9]. oO 


We continue our informal discussion by looking at smooth morphisms. 
For our purposes, the most important examples of smooth morphisms will 
be schemes which are smooth over a discrete valuation ring or Dedekind 
domain R. In this situation, the condition that X be smooth over FR is 
essentially equivalent to the assertion that all of its fibers are non-singular 
and have the same dimension. 
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Proposition 2.9. Let R bea discrete valuation ring with fraction field K, 
residue field k, and maximal ideal p. Let X be an integral (i.e., reduced 
and irreducible) R-scheme of finite type over R whose generic fiber X,,/K is 
non-empty. Then X is a smooth R-scheme if and only if X,(K) and X,(k) 
contain no singular points. 


PROOF. The scheme X is irreducible, so it has a unique generic point. Our 
assumption that the generic fiber X,, is non-empty shows that the generic 
point of X maps to the generic point of Spec R, so X is flat over Spec R from 
Hartshorne [1, [JI.9.7]. If R contains its residue field k (the so-called func- 
tion field case), the desired result then follows from Hartshorne [1, III.10.2]. 
The general case is Milne [4, I Prop. 3.24], see also Bosch-Liitkebohmert- 
Raynaud [1, §2.4, Prop. 8]. 

O 


There are many theorems in algebraic geometry which say that some 
property of morphisms, such as properness, smoothness, separability, finite- 
ness, etc., is preserved under composition, base extension, and products. 
The only result of this sort that we will need is the following assertion that 
the composition of smooth morphisms is again smooth. 


Proposition 2.10. If@:X — Y andw:Y — Z are smooth morphisms, 
then the composition Wo @: X — Z is a smooth morphism. 


PRooF. See Hartshorne [1, IIf.10.1c] or Altman-Kleiman [1, VII.1.7ii]. 
Oo 


We now look at some examples of regular schemes and proper and 
smooth morphisms. 


Example 2.11.1. Let R be a discrete valuation ring with uniformizer 7. 
We assume that 2 € R*. Let X C P% be the scheme given by the equation 


X27 4 ay? = 27. 
Then X is proper over R from (2.8), since it is a closed subscheme of P%. It 
is also easy to check that the scheme X is irreducible and regular. However, 
the special fiber of X is given by the equation x? = z?, so the special fiber 
is reducible and singular. Hence X is not smooth over R. 


Example 2.11.2. We continue analyzing example (2.6.3), so R is a dis- 
crete valuation ring with uniformizer 7 and 2,3 € R*, and X Cc Pe is the 
scheme defined by the equation 


X:yz=e22+az3 for some a € R. 


We also let K be the fraction field of R, k the residue field of R, and p the 
maximal ideal of R. Notice that X is proper over R by (2.8), since it is a 
closed subscheme of P}. 
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Suppose first that a € R*. Then the special fiber X,/k is a non- 
singular curve, so X is smooth over R from (2.9). 

Next, suppose that a = O0(modp). Then the special fiber X,/k is 
given by the equation y*z = 2°, so the special fiber is singular and X 
is not smooth over R. Let y € Xp C X be the singular point on the 
special fiber, and let X° = X \ ¥ be the scheme obtained by removing ¥ 
from X. This makes the special fiber x? /k non-singular, so X° is smooth 
over R. However, removing the point 7 has destroyed the completeness of 
the special fiber, so X° is not proper over R. 

Finally, we observe that if a is a uniformizer for R, then (2.6.3) says 
that X is regular. This is true despite the fact that its special fiber X, 
is singular and so X is not smooth over R. We will prove later (4.4) that 
since X is regular, every R-valued point of X lies in the smooth part X°. 
In other words, the natural inclusion X°(R) C X(R) is an equality, so in 
this situation X° retains a sort of properness property over R. 


This last example (2.11.2) illustrates an important general phenome- 
non. Let X be a (nice) scheme which is proper over a discrete valuation 
ring R and which has a smooth generic fiber. Then X need not be smooth 
over R, since its special fiber Xp may have singularities. We can create 
a smooth R-scheme X° by removing from X the singular points on its 
special fiber, but then X° will not be proper over R. Thus the attributes 
of properness and smoothness are somewhat antithetical to one another. 

However, if the original scheme X is regular, then it turns out that 
every R-valued point in X(R) actually lies in X°(R). So for regular 
schemes, X° still behaves to some extent as if it were proper over R. We 
will prove this later (4.4) when X has relative dimension one over R. The 
general case we leave as an exercise. 

We are now ready to define properness and smoothness, but we want to 
stress that the most important thing is for the reader to understand the un- 
derlying intuitions and the examples described above. For further reading 
on this material, see Hartshorne [1, III §§9,10], Altman-Kleiman [{1, V,VJj, 
and Bosch-Liitkebohmert-Raynaud [1, 2.1-2.4]. 


Definition. Let ¢@ : X — S be a morphism of finite type. The map @ 
is separated if the diagonal morphism 6x : X — X xg X (2.3) is a closed 
immersion. The map ¢ is universally closed if for any base extension S’ > 
S, the map X xg S’ — S’ sends closed set to closed sets. The map @ is 
proper if it is separated and universally closed. We also say that X is 
proper over S, or that X is a proper S'-scheme. 


Definition. Let @: X — S be a morphism of finite type, let « € X, and 
let s = (a) € S. The map ¢ is smooth (of relative dimension r) at a 
point x € X if there are affine open neighborhoods 


s€SpecRCS and xeSpecAc xX 
with 
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A= Rith,...,tntr|/(fi.---.fn) for some fi,..., fn € Riti,...,tr+r] 


so that the n xn minors of the Jacobian matrix (Of;/Ot;) generate the unit 
ideal in A. 

We say that ¢ is smooth (or that X is smooth over S) if ¢ is smooth 
at all points of X. A morphism that is smooth of relative dimension zero 
is called an étale morphism. 


Remark 2.12. The Jacobian condition in the definition of smoothness is 
similar to the criterion we used to define non-singular points on varieties 
in [AEC, I §1]. In particular, it is clear that if X — Spec R is a smooth 
morphism and p € Spec FR is a maximal ideal, then the fiber Xp is a non- 
singular variety over the residue field R/p, which is a special case of (2.9). 
There are many other ways to define smoothness. One of the most useful 
is in terms of the sheaf of relative differentials of X/S, see for example 
Hartshorne [1, II §8, II 810), Altman-Kleiman [1, VI,VII], Milne [4, 1§3] 
or Bosch-Liitkebohmert-Raynaud [1, §2.1,2.2]. For a fancy functorial defi- 
nition, see Milne [4, 1.3.22]. 


§3. Group Schemes 


A group scheme over S is an S-scheme G whose fibers form an algebraic 
family of groups, similar to the example described in (2.4). This means we 
should be able to multiply two points provided they lie on the same fiber, 
and the multiplications should fit together to give a group law on the fiber 
product G xs G. More formally, a group scheme over S' is a group in the 
category of S-schemes. This leads to the following definition. Note that we 
must be careful to define everything in terms of maps, rather than in terms 
of points. (An alternative approach is to define a group scheme in terms of 
its associated functor of points; see (2.1.2) or Eisenbud-Harris [1, IV.A.v].) 


Definition. Let S be a scheme. A group scheme over S is an S-scheme 
ma:G-— S and S-morphisms 


09:5 —G, i:G—-+G, u:GxsG—G, 


such that the following diagrams commute: 
(i) (identity element) 
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(ii) (tnverse) 


CGxee 225) Ose sexe = GxeG 
ic he ° 8 r 
G ats IG Se Cc G ee Bs ey G 


(Here G 22; G xg G is the diagonal map (2.3).) 
(iii) (associativity) 


GxsGxsG a GxsG 


[xe |e 


GxsG a5, G 


Example 3.1.1. Let G be a group variety defined over a field K as dis- 
cussed in §1. Then G is a group scheme over the one point scheme S = 
Spec(). This is clear from the definitions. Note that the identity mor- 
phism op : S — G sends the one point in S to the identity element of G. 


Example 3.1.2. The additive group scheme G, over Z is the scheme G, = 
Spec Z[T]. The group law on G, is given by 


Ga Xz Ga —- Ga 
| 
(Spec Z[T;]) xz (Spec Z[Tp]) 
| 
Spec (Z[Ti] Qz Z{T2]) 


| 
Spec Z[T), T2] — SpecZ[T], 


where the morphism Spec Z[T,, 72] — Spec Z[T] is induced by the ring 
homomorphism 


Z|T| SS Z[T), To], T b> Ti + To». 


For any ring R, we have G(R) = R with group law given by addition 
on R. The additive group scheme Ga;g over an arbitrary scheme S is the 
group scheme G, xz S obtained by base extension. In particular, Gayp = 
Spec R[T]. 
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Example 3.1.3. The multiplicative group scheme G,, over Z is the scheme 
Gm = SpecZ[T,T—1]. The group law on G,, is given by 


Gn XZ Gy v= Gn 


I 
(Spec Z[T1,T; *]) xz (Spec Z[T2, T> ']) 


I 
SpecZ(fs Fy tot | — SpecZ(T,T~1], 


where the morphism Spec Z[T), T,;',T2, Ty '] + Spec Z[T,T~'] is induced 
by the ring homomorphism 


Spec Z[T,T~*] — Z[T,,T,',T2,Ts'],  T—- TyTo. 


For any ring R, we have G,,(R) = R* with group law given by multi- 
plication on R*. The multiplicative group scheme Gms over an arbitrary 
scheme S' is the group scheme G,, xz S obtained by base extension. In 
particular, Gu;p = Spec R[T,T~"). 


Example 3.1.4. Let R be a discrete valuation ring with maximal ideal p 
and fraction field K, and let E/K be an elliptic curve with good reduction 
at p. Fix a minimal Weierstrass equation for FE, 


E: yz +ayry+azy = x? + agu? + age + ag. 


The coefficients of this equation are in R, so we can use the equation to 
define an R-scheme € C lige (Of course, we need to homogenize the equa- 
tion first.) The fact that & has good reduction implies that the scheme € 
is smooth over R, since good reduction is equivalent to the fact that the 
special fiber €, of E — Spec R is a smooth elliptic curve over the residue 
field R/p. 

The addition law on £ is given by rational functions with coefficients 
in R, so it induces a rational map 


BwiExXRpE—E. 


We know from [AEC, III.3.6] that the addition law E x E — E on the 
generic fiber is a morphism. We will later give two proofs that w itself is 
an R-morphism. The first proof (5.3) uses explicit equations and is similar 
to the argument in [AEC, III.3.6, III.3.6.1]. The second proof (6.1) uses 
fancier machinery to prove a much stronger result. 


The next proposition shows that the set of T-valued points of a group 
scheme form a group. 
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Proposition 3.2. Let G be a group scheme over S, let T be an arbi- 
trary S-scheme, and let G(T) be the set of T-valued points of G, which 
recall is the set of S-morphisms T — G. For any two elements ¢, 1 € G(T), 
define a new element ¢ * = € G(T) by the commutativity of the diagram 


TxsT ad GxgG 


Jos |» 


ps: ES se 
where 67 is the diagonal map (2.3). In other words, 
oxy =po(dxy)obr € G(T). 


This operation gives G(T) the structure of a group. The identity element 
is 09 om7, where np: T — S is the map making T into an S-scheme. The 
inverse of d isio @. 

More precisely, the association T — G(T) is a contravariant functor 
from the category of S-schemes to the category of groups. 


Proor. All of this follows from the definitions and elementary diagram 
chases. For example, to verify that op o7r is the identity element of G(T), 
we observe that the following diagram is commutative: 


1xrr 
G xs T — G xs S 
Jou | 1x20 
ox (aoorr) 


TxsT SS GxsG 


or [« 
p*(aooTr) 
-P — G 


But the definition of oo tells us that the map ~o(1 xo9):GxgS—-G 
down the right-hand side of this diagram is projection onto the first factor. 
Hence tracing around the boundary of the diagram yields 


b* (a0) =pio(1x mr)o (dx ljoby =p. 0(ox lhcb =. 


We will leave it to the reader to perform the similar computations 
needed to check that 7 o @ is the inverse of @ and that the associative law 
holds, which completes the proof that G(T) is a group. 

Finally, the functoriality statement means that if f :T’ — T is an S- 
morphism of S-schemes, then the map 


GT) — GT), gr > oof, 
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is a homomorphism of groups. It is clear that the identity element is 
mapped to the identity element, so we must verify that 


(pxb)of=(dof)*(Wof) forall d,f €G(Z). 
The definition of @ * W says that the right-hand square of the following 
diagram is commutative, and the left-hand square is clearly commutative: 


7! gt VIE te OE RG 


Iv fe 
- £ a = we 

The map (¢ x =) o(f x f) along the top row of this diagram is equal to 

(po f) x (Wo f), so by definition the map along the bottom row equals 

(p°o f) x (wo f). This is the desired result, which completes the proof of 

Proposition 3.2. oO 


Remark 3.3. In our study of elliptic curves and group varieties, the 
translation-by-P maps provided an important tool. A group scheme G/S 
is not a group, so we cannot translate G by a point of the scheme G. In- 
stead, G is a family of groups parametrized by the points of S. So in order 
to translate G, we need to start with a family of points on G parametrized 
by S. Then we can translate each group in the family by the appropriate 
point. We formalize this idea in the following manner. 

Let G/S be a group scheme, and let ¢ € G(S) be an S-valued point 
of G. Then the (right) translation-by-o morphism is the S-morphism 
Tz :G — G defined by the composition 


pi ae SES (Cea Eee! hae 


To understand 7, further, we note that the S-valued point o is a map 
ao:S—G. In particular, for each point s € S, we get a point o(s) on 
the fiber G,, where G, is a group variety over the residue field at s. The 
restriction of tT, to the fiber G,, is precisely translation by the point o(s) € 
G,. Thus 7, can be viewed as a family of translations of the fibers of G. 


Remark 3.4. Another important tool in our study of elliptic curves was 
the collection of multiplication-by-m maps. These maps can be defined 
inductively on every group scheme in the following way. Let 7: G— S be 
a group group scheme over S with identity element a9 : S — G, inverse map 
i: G— G, and group multiplication 1: GxgG— G. Also let idg:G—-G 
be the identity map on G. For each integer m, the multiplication-by-m map 
on G is the morphism 
[m]:G—-G 
defined inductively by the rules 


[=ide, (mt 1J=po([m]x[t]), fr —1] = po ((m] x). 
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§4. Arithmetic Surfaces 


Let R be a Dedekind domain. An arithmetic surface over Spec(R) is the 
arithmetic analogue of the fibered surfaces studied in [II §8. Here Spec(R) 
plays the role of the base curve, and an arithmetic surface is an R-scheme 
© — Spec(R) whose fibers are curves. For example, if R is a discrete valu- 
ation ring, then there will be two fibers. The generic fiber will be a curve 
over the fraction field of R and the special fiber will be a curve over the 
residue field of R. Just as in the case of fibered surfaces, an arithmetic 
surface C may be regular (non-singular) even if it has singular fibers. 


Definition. Let R be a Dedekind domain with fraction field K. For exam- 
ple, R could be a discrete valuation ring. Intuitively, an arithmetic surface 
(over R) is a “nice” R-scheme © whose generic fiber is a non-singular con- 
nected projective curve C/K and whose special fibers are unions of curves 
over the appropriate residue fields. Note that the special fibers may be 
reducible or singular or even non-reduced. This intuitive definition will 
suffice for our purposes, but for the technically inclined, we indicate that 
the word “nice” is an abbreviation which means that C is an integral, nor- 
mal, excellent scheme which is flat and of finite type over R.* 


Remark 4.1.1. An arithmetic surface @ is a one-dimensional family of 
one-dimensional varieties, so it is a scheme of dimension two. One might 
instead call C a curve over R, since it has relative dimension one over R 
(i.e., the fibers are one-dimensional). We will frequently be interested in 
arithmetic surfaces which are regular, or proper over R, or smooth over R. 
We recall the intuitions from section 2. An arithmetic surface C is regular if 
it is non-singular as a surface, C is proper over R if its fibers are complete, 
and € is smooth over R if its fibers are non-singular. If C is smooth over R, 
then it is automatically regular, but in general the converse is not true. 


Remark 4.1.2. The definition of an arithmetic surface C ensures that even 
if € is not regular, its set of singular points is a finite set of closed points. 
In other words, an arithmetic surface is regular in codimension one. This 
means that there is a theory of Weil divisors on ©. In particular, for any 
irreducible curve F C € (equivalently, any point F € © of codimension 
one), the local ring Of of @ at F is a discrete valuation ring. We denote 
the corresponding normalized valuation by 


ordr : K(C)* — Z, 


t Integral is equivalent to reduced and irreducible (Hartshorne [1, II.3.1]), 
normal means that all local rings are integrally closed (Hartshorne [1, II ex- 
ercise 3.8]), flat means that the fibers vary “nicely” (Hartshorne [1, III §9]), 
finite type means the extensions of local rings are finitely generated algebras 
(Hartshorne (1, II §3]), and excellent is a somewhat technical condition which 
won't concern us, but see for example Matsumura [1, Ch. 13]. 
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2 8 © © (0) 


The Arithmetic Surface € : y? = x° + 2x? + 6 over Spec(Z) 
Figure 4.1 


where A (C) is the function field of ©. For the basic thoery of Weil divisors, 
principal divisors, and the divisor class group, see Hartshorne [1, II §6]. We 
will continue our discussion of divisors on arithmetic surfaces in section 7. 


Example 4.2.1. The projective line P}, over R is an arithmetic surface 
over R. For any maximal ideal p of R, the fiber over p is P}, the projective 
line over the residue field k = R/p. Notice that P} is both proper and 
smooth over R. 


Example 4.2.2. Let € ¢ P% be the closed subscheme of P2 given by the 
equation 
C:y? =a? + 227 +6. 


The generic fiber of © is an elliptic curve £/Q with discriminant A = 
—2° . 3-97, so for all primes p 4 2,3, 97, the fiber @, is a (non-singular) 
elliptic curve over F,. The fibers over the “bad” primes are 


Co:y®=2*,  Cg:y*=27(e@4+2), Coz: y* = (x + 66)7(@ + 64). 


The arithmetic surface C/Z is illustrated in Figure 4.1. 

The arithmetic surface € is proper over Z, since it is a closed subscheme 
of PZ (2.8). It is clear that @ is not smooth over Z, since it has singular 
fibers. We claim that © is a regular scheme. To see this, it suffices to 
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check that C is regular at the singular points on the fibers. We will check 
that the point P € C corresponding to the cusp x = y = 2 = 0 on the 
fiber Cg is non-singular. The maximal ideal Mp of the local ring Op at P 
is generated by z, y, and 2, and the residue field at P is Op/Mp = Fo. By 
definition, C is regular at P if 


dimg, Mp/M?, = 2. 


This dimension cannot be less than two, so we must show that Mp/M? 
can be generated by two of x, y, 2. Using the equation for C, we see that 


223 '@? 22° 02°) ME; 


so x and y are generators. This proves that C is regular at P. We will 
leave for the reader the analogous calculations at z = y = 3 = O and 
x +66 = y = 97 = 0 (exercise 4.14). 

The scheme @ is thus regular and proper over Z. If we discard the 
three singular points on the three singular fibers, we obtain an open sub- 
scheme ©° Cc € with the property that @° is smooth over Z. Of course, C° 
will not be proper over Z, since some of its fibers are missing points. 


Example 4.2.3. Let @ C P3 be the closed subscheme of P? given by the 
equation 

C: yr? =a3 + 22744, 
The singular fibers of C are C2, C5, and Cz. The scheme € is not regular, 


since one easily checks that the point x = y = 2 = 0 is a singular point 
of C. 


Let 7 : C — Spec(R) be an arithmetic surface and let p € Spec(R) be 
a point with residue field k, = R/p. The fiber 


Cy = ¢ XRP= e X Spec(R) Spec(kp) 


is a curve, but it may be reducible or singular or even non-reduced. More 
precisely, we can write the fiber as a union 


r 
C, = > ni Fy 
i=1 


for certain irreducible curves F),...,F;-/kp and multiplicities n1,...,n- > 1 
in the following manner. Fix a uniformizer u € R for p, that is, ordy(u) = 1. 
Then 7*(u) = uo7z is a rational function on @, and the fiber of € over p is 
given by 
Cp = S> ord p(n*u)F. 
FCr-1(p) 
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F3 F) 


The Special Fiber Cs : (y? — 2? — 3x7)(y — 2)?(2y — x — 3) =0 
Figure 4.2 


Here the sum is over the irreducible components of the fiber over p, and 
ordr is the normalized valuation on K(C) corresponding to F (4.1.2). 

There are several ways in which a point x € Cy can be a singular point 
of the fiber Cp. It may lie on a component F with multiplicity n > 2, it 
may be a point where two or more components intersect, or it may be a 
singular point of a particular component. The following example illustrates 
these ideas. 


Example 4.2.4. Consider the arithmetic surface C C A? defined by the 
equation 


C: 2y° — (x + 1)y* ~ (223 + 2? + 2)y? 
+ (24 — 23 4+ 327 + & — 2)y? + (24 4 323)y — ct — 3 +o? = 5. 


We are going to look at the special fiber Cs of C over the point (5) € Spec Z. 
This special fiber is the curve in Ag. defined by reducing the equation of € 
modulo 5, so after some algebra we find 


Cs : (y? — 2? — 3a?)\(y ~ 2)?(2y — x — 3) = 0. 
Thus Cs consists of three irreducible components, which we label as 
Fi: y? =2° 4327, Fo: y=2, Fa: 2y=x4+3. 


We have illustrated the special fiber Cs in Figure 4.2. Such illus- 
trations can be very useful for visualizing components, multiplicities, and 
intersections, as long as one keeps in mind that one is looking at a drawing 
in R? which purports to represent a curve in characteristic p! In particu- 
lar, there may be intersection points which are “hidden” because R is not 
algebraically closed. 

The component F of C5 appears with multiplicity 2, and each of the 
other components has multiplicity 1, so as a scheme the special fiber has 
the form 

C3 = Fi + 2Fo + Fs. 
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In particular, the scheme @; is neither irreducible nor reduced. Every point 
on F> is a singular point of Cs, since F2 itself appears with multiplicity 
greater than 1. The other singular points on the special fiber are the 
node (0,0) on Fi, and the points where the various F;’s intersect, such as 
the point (3,2) on F, M Fy and the points (2,0) and (3,3) on FUN F3. 
(Remember that the fiber Cs lives in characteristic 5.) 


The next proposition says that if an arithmetic surface C is regular 
and if a point x € C, on its special fiber lies in the image of an R-valued 
point P € €(R) (ie., if ¢ = P(p)), then x is automatically a non-singular 
point of Cy. 


Proposition 4.3. Let 7: @C — Spec(R) be a regular arithmetic surface 
over a Dedekind domain R, and let p € Spec(R). 
(a) Let x € Cy C © be a closed point on the fiber of © over p. Then 


C, is non-singular at « <=> n*(p) ¢ MG... 


Here n* is the natural map 7* : R — Oe, induced by 7, and Me. is the 
maximal ideal of the local ring Oe,, of © at x. 
(b) Let P € C(R). Then Cy is non-singular at P(p). 


PROOF. To ease notation, we will write 
C=C, = C xp (R/p) 


for the fiber of C over p, and we will let P = 7*(p)Oe 2. Notice that Pc 
Me,+, since x lies on the special fiber over p. 
(a) We first assume that $B ¢ M2 _, and prove that z is a non-singular point 
of C. We are given that C is regular, so, by definition, Oe, is a regular local 
ring of dimension two. This means that we can find elements f,, fo € Mex 
so that 

Mew = fiOe s+ fo0ex + Me. 


If we write p= tR, then m*(t) € BC Mex, so 
m*(t)=aifitazfz (mod Mé,,) for some a1, a2 € Oe». 


Our assumption is that 7*(t)Oe. =P ¢ ME which means that at least 
one of a; and ag is not in Me,s, and hence at least one of them is a unit 
in Oe. Switching f; and fo if necessary, this means that 


Me,2 = 7" (t)Oc2+ f2o00e,c + Me. = Bt fo0ec + ME... 


The fiber @ (as a scheme, which includes multiplicities associated to 
non-reduced components) is C = © xr (R/p), so its local ring at x is ob- 
tained from the local ring of © by reduction modulo p. In other words, 


Oe, => Oe.2/8 and Me» = Me.+/f. 
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Therefore 
Mex = (PB + foOex + Mé.2) /P= S208. + Me: 


Hence Me ,/M3 , is generated by the single element fo, which shows 
that Og, isa regular local ring of dimension one, and so z is a non-singular 
point of é. 

This proves the implication that we will need for part (b). We will 
leave the proof of the opposite implication as an exercise for the reader 
(exercise 4.17). 

(b) We assume that 7*(p) C M2_, and derive a contradiction. Using the 
fact that 7 o P is the identity map on Spec(R), we compute 


p = (10 P)*(p) = P* on*(p) C P*(M2.,) = (P*Mea)? =p. 


The last equality follows from the fact that P : Spec(R) — € is a mor- 
phism of schemes, so by definition (Hartshorne [1, II §2]) the induced map 
P*: Oe — Ry is a local homomorphism of local rings. This means in 
particular that P*Me,. = p. 

But p is a maximal ideal of the Dedekind domain R, so the inclu- 
sion p C p? is impossible. Therefore *(p) ¢ M,. Applying (a), we 
conclude that x is a non-singular point of the fiber C,, which concludes the 
proof of (b). Oo 


The following important corollary says that the smooth part of a 
proper regular arithmetic surface is large enough to contain all of the ra- 
tional points on the generic fiber. For an example which shows that the 
regularity condition is necessary, see (5.4.4) in the next section. 


Corollary 4.4. Let R be a Dedekind domain with fraction field K, 
let C/R be an arithmetic surface, and let C/K be the generic fiber of C. 
(a) If © is proper over R, then 


C(K) = C(R). 
(b) Suppose that the scheme @ is regular, and let C° C @ be the largest 
subscheme of @ such that the map C° — Spec(R) is a smooth morphism. 
Then 

e(R) = ©°(R). 


(c) In particular, if C is regular and proper over R, then 


C(K) = C(R) = €°(R). 
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PrRooF. (a) This is really just a special case of the valuative criterion of 
properness. Any point in €(R) can be specialized to the generic fiber to 
give a point in C(), so there is a natural map C(R) — C(K). This 
map is clearly one-to-one, since two morphisms Spec(R) — © which agree 
generically (i.e., on a dense open set) are the same. Thus C(R) > C(K). 

Let P € C(K) bea point. We are given that € is proper over R, so the 
valuative criterion (2.7) says that there is a morphism op : Spec(R) > € 
making the following diagram commute: 


C= CxRrk = e 


jp 
Spec(K) —  Spec(R). 


This proves that every point in C(K) comes from a point in C(R), so 
C(R) = C(K). 

(b) Proposition 4.3 says that every point in C(R) intersects each fiber 
at a non-singular point of the fiber. But, by definition, C° is the com- 
plement in € of the singular points on the fibers. Therefore the natural 
inclusion €°(R) < @(R) is a bijection. 

(c) This is immediate from (a) and (b). oO 


The previous corollary (4.4) says that if © is a regular arithmetic sur- 
face that is proper over R, then the smooth part C° of € is large enough 
so that all of the K-valued points on the generic fiber extend to R-valued 
points of C°. This raises two questions. First, given a (non-singular projec- 
tive) curve C defined over K’,, does there exist a regular arithmetic surface C 
proper over R whose generic fiber is C/K? Second, assuming such proper 
regular models exist, to what extent is there a minimal such model? The 
following theorem gives the answer to these questions. It is the arithmetic 
analogue of the geometric results described in (III.7.7) and (III.8.4). We 
will discuss the construction of these minimal models further in §7. 


Theorem 4.5. Let R be a Dedekind domain with fraction field K, and 
let C/K be a non-singular projective curve of genus g. 

(a) (Resolution of Singularities for Arithmetic Surfaces, Abhyankar [1,2], 
Lipman [1,2]) There exists a regular arithmetic surface @/R, proper over R, 
whose generic fiber is isomorphic to C/K. We call C/R a proper regular 
model for C/K. 

(b) (Minimal Models Theorem, Lichtenbaum [1], Shafarevich [2]) Assume 
that g > 1. Then there exists a proper regular model C™"/R for C/K 
with the following minimality property: 

Let C/R be any other proper regular model for C/K. Fix an isomor- 
phism from the generic fiber of © to the generic fiber of C™™. Then the 
induced R-birational map 

C --5 emin 
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is an R-isomorphism. We call C™™/R the minimal proper regular model 
for C/K. It is unique up to unique R-isomorphism. 


PROOF. (a) See Abhyankar [1,2] and Lipman [1,2]. There is also a nice 
exposition of Lipman’s proof in Artin [2]. In the case that C has genus 1, 
we will explicitly construct a proper regular model for C/K in §9. 

(b) See Lichtenbaum [1, Thm. 4.4] and Shafarevich [2, lectures 6,7,8]. 
There is a nice summary of the main results with sketches of the proofs in 
Chinburg [1]. See also §7 for a further discussion. Oo 


Just as in (III.8.4.1), the importance of the minimal regular model 
lies in the fact that every automorphism of its generic fiber extends to a 
morphism of the entire scheme. 


Proposition 4.6. Let R be a Dedekind domain with fraction field K, 
and let C/K be a non-singular projective curve of genus g > 1. Let C/R 
be a minimal proper regular model for C/K, and let @° c € be the largest 
subscheme of © which is smooth over R. Then every K-automorphism 
1 :C/K — C/K of the generic fiber of C extends to give R-automorphisms 


7r:@0C—eC and 720° — e?, 


PROoF. The fact that 7 extends to an R-automorphism C — C is exactly 
the definition of minimality given in (4.5b). Next take any point x € @° 
and choose some neighborhood U c €° of x. Then U is smooth over R. 
Further, U is an open subset of @, since C° is open in C. We know that 
7:@— € is an R-isomorphism, so 7(U) is an open neighborhood of 7(z) 
and is smooth over R. Therefore r(x) € ©°, which proves that 7(€°) c @°. 
Applying the same argument to T~! gives r~!(€°) C @°, which completes 
the proof that 7 gives an R-automorphism of C°. Oo 
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Let K be the fraction field of a discrete valuation ring R. The Néron model 
of an elliptic curve E/K is an arithmetic surface €/R whose generic fiber is 
the given elliptic curve. The scheme €/R is characterized by the fact that 
it is large enough so that every point of E gives a point of €, but small 
enough so that the group law on F extends to make € into a group scheme 
over R. Of course, when we talk of “points of €,” we mean more than just 
the points of the underlying scheme. This leads to the following definition. 
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Definition. Let R be a Dedekind domain with fraction field K, and let 
E/K be an elliptic curve. A Néron model for E/K is a (smooth) group 
scheme €/R whose generic fiber is E/K and which satisfies the following 
universal property: 


Let X/R be a smooth R-scheme (i.e., X is 

smooth over R) with generic fiber X/K, and Néron 
let Ox : X/~K — E;K be a rational map de- (starr) 
fined over K. Then there exists a unique R- Property 
morphism ¢r : X/;p — E/p extending ox. 


Remark 5.1.1. A Néron model €/R is a smooth R-scheme. This means 
that for every point p € Spec(R), the fiber €, of € — Spec(R) is a non- 
singular variety defined over the residue field k(p); see (2.9). However, 
as we will soon see, the fiber €, over a closed point p may have several 
components and may not be complete, so in general € will not be proper 
over R. 


Remark 5.1.2. In the Néron mapping property we have only required 
that the map x : X/~K — Ex on the generic fiber be a rational map. It 
turns out that any rational map from a non-singular variety to an elliptic 
curve is a morphism. See (6.2b) below for an even more general statement. 


Remark 5.1.3. The most important instance of the Néron mapping prop- 
erty is the case that X = Spec(R) and X = Spec(K’). Then the set of K- 
maps X/x — E/x is precisely the group of K-rational points E(k), and 
the set of R-morphisms Xp — €/pr is the group of sections €(R). So in 
this situation the Néron mapping property says that the natural inclusion 


€(R) — E(K) 


is a bijection. If R is a complete discrete valuation ring with algebraically 
closed residue field, then one can show that the equality €(R) = E(k) 
suffices to ensure that the group scheme €/R is a Néron model for E/K. 
See exercise 4.30. 


We begin our study of Néron models by proving that they are unique 
and behave well under unramified base extension. 


Proposition 5.2. Let R be a Dedekind domain with fraction field K, 
and let E/K be an elliptic curve. 

(a) Suppose that €,/R and €2/R are Néron models for E/K. Then there 
exists a unique R-isomorphism w : €1/R — €2/R whose restriction to the 
generic fiber is the identity map on E/K. In other words, the Néron model 
of E/K is unique up to unique isomorphism. 

(b) Let K'/K be a finite unramified extension, and let R’ be the integral 
closure of R in K’. Let E/R be a Néron model for E/K. Then Exp R’ isa 
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Néron model for E/K’. (N.B. If K'/K is ramified, this result will generally 
not be true.) 


PRooF. (a) The identity map E/K — E/K is a rational map from the 
generic fiber of €; to the generic fiber of Eg, and €; is smooth over R, 
so the Néron mapping property for €2 says that the identity map extends 
uniquely to an R-morphism y~: €;/R— €2/R. In a similar fashion we 
obtain a unique R-morphism ¢: €2/R— €,/R which is the identity map 
on the generic fiber. But then dow: €;/R — €,/R and the identity map 
€,/R — €,/R are R-morphisms which are the same on the generic fiber, so 
the uniqueness part of the Néron mapping property says that ¢o w equals 
the identity map. This proves that ¢ and w are isomorphisms. 

(b) Let X’/R’ be a smooth R’-scheme with generic fiber X'/K’, and let 
OK: Xe) — E/x bea rational map. The composition 


X' —> Spec(R’) —> Spec(R) 


makes X’ into an R-scheme. Further, our assumptions on K‘’ imply that 
the map 
Spec(R’) —> Spec(R) 


is a smooth morphism. (See exercise 4.19.) Hence the composition is a 
smooth morphism (2.10), so X’ is a smooth R-scheme. 
Now the Néron mapping property for €/R tells us that there is an R- 
morphism 
pe 
whose restriction to the generic fiber is the composition 
Se ES By ere Pp: 


The two R-morphisms dp : X’ — E€ and X’ — Spec(R’) determine an R- 
morphism (and thus an R’-morphism) to the fiber product, 


PR PX xR R. 


Further, the restriction of dp to the generic fiber is x. This gives the 
existence part of the Néron mapping property. We will leave it to the reader 
to prove the uniqueness part, which completes the proof that € xp R’ isa 
Néron model for E'/K’. Oo 


Let R be a discrete valuation ring with fraction field K. We are going 
to use the Weierstrass equation of an elliptic curve E/K directly to con- 
struct an R-group scheme W° C P?, whose generic fiber is E/K. If the clo- 
sure W of W® in P?, is regular, then we will also prove that W°(R) = E(K), 
so W® satisfies the most important instance of the Néron mapping prop- 
erty (5.1.3). In particular, if EF has good reduction, then we will see (6.3) 
that a minimal Weierstrass equation for E/K already defines a Néron 
model. 
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Theorem 5.3. Let R be a discrete valuation ring with fraction field Kk, 
let E/K be an elliptic curve, and choose a Weierstrass equation for E/K 
with coefficients in R, 


EB: y? + ary + asy = £3 + agz? + asx + ag. 


This Weierstrass equation defines a scheme W C P®. Let W® C W be the 
largest subscheme of W which is smooth over R. 

(a) Both W/R and W°/R have generic fiber E/K. 

(b) The natural map W(R) — E(K) is a bijection. If W is regular, then 
the natural map W°(R) — W(R) is also a bijection, so in this case there is 
a natural identification W°(R) = E(K). 

(c) The addition and negation maps on E' extend to R-morphisms 


W? xp W® — Ww? and w? —. we 


which make W® into a group scheme over R. The addition map further 
extends to an R-morphism 


W? xpw— Ww 


giving a group scheme action of W® on W. 


Remark 5.4.1. If E/K has good reduction and if we take a minimal 
Weierstrass equation for E/K, then W itself is smooth over R. So in this 
case (5.3) says that W = W? is a group scheme over R. 


Remark 5.4.2. If E/K has bad reduction, then there is exactly one sin- 
gular point on the reduction E (mod p). In other words, the special fiber Ww 
of W contains exactly one singular point, say y € We W, and then W® is 
obtained by discarding that point, 


W? = W\ {7}. 


Remark 5.4.3. The scheme W/R in (5.3) is proper over R, since it is a 
closed subscheme of P?, (2.8). It follows from the valuative criterion (4.4a) 
that W(R) = E(k). However, in general, the scheme W will not be regular, 
since a singular point on the special fiber will often be a singular point 
of W. So, in general, we cannot use (4.4b) to deduce that W°(R) = E(K). 
Intuitively, if W is singular, then there will be points P € E(K) = W(R) 
which go through the singular point of W. Thus, in general, W® will not 
be large enough to be a Néron model, because W°(R) # E(K), whereas W 
itself will be too large to be a Néron model, because the group law on FE 
will not extend to all of W. 
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Example 5.4.4. We illustrate the previous remark by looking at the curve 
E:y? +2y = 2° 4+ a¢. 


If ag € R*, then W is smooth over R (exercise 4.20(b)), so W is a Néron 
model for E. If v(ag) > 1, then the special fiber 


w : y? + ry = x 
has the singular point (0,0), so 
W® = W\~ {(0,0)}. 


We consider two cases. 

First, if v(ag) = 1 (ie., ag is a uniformizer in R), then W is a regular 
scheme from exercise 4.20(a). It follows from (4.4c) that W°(R) = W(R) = 
E(k). We can also see this directly as follows. If P € W(R) were to 
go through the singular point on the special fiber, then we would have 
P = (x,y) = (0,0) (mod p), which means that x,y € p. But then the 
equation for & would give 


ag =y? +2y— 2° Ep’, 


contradicting the assumption that v(ag) = 1. Hence W°(R) = E(K). 

Second, if v(ag) > 2, then W is not a regular scheme and W® will 
not be a Néron model for E, despite the fact that W° is a group scheme 
with generic fiber E. For example, suppose that ag = a? with u(a) > 1. 
Then the point P = (0,a) € E(K) = W(R) is not in W°(R), since P = 
(0,0) (mod p). 


PROOF (of Theorem 5.3). (a) W is projective over R, since it is the closed 
subscheme of P?, = Proj R[X,Y,Z] defined by the single homogeneous 
equation 


W:Y°Z +a, XYZ 4+ a3YZ? = X? 4agX7Z +04XZ? +062". 


Its generic fiber is the variety in P?, defined by this same equation. Thus 
the generic fiber of W is precisely E/K. 

If W® is not equal to W, then as described above in (5.4.2), W° consists 
of W with one point on the special fiber removed. In particular, W° and W 
have the same generic fiber, so the generic fiber of W° is also E/K. 
(b) The scheme W is a closed subscheme of P%, so it is proper over R (2.8). 
Further, its generic fiber is E/K from (a), so (4.4a) tells us that B(K) = 
W(R). This proves the first part of (b). If in addition W is regular, 
then (4.4b) says that W(R) = W®°(R), which gives the second equal- 
ity E(K) = W°(R). 
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(c) Let 
uu: WxRW--->W and i:W---»>W 


be the rational maps on W induced by the addition and negation laws on 
the generic fiber E/K of W. The fact that the generic fiber is a group 
variety means that ys and i satisfy all of the group axioms on a non-empty 
open subscheme of W, so they will satisfy the group axioms on the largest 
open subscheme on which they are defined. In other words, if we can show 
that is a morphism on W® xg W and that 7 is a morphism on W, then 
the group axioms are automatically true. 

If W is smooth over R, we are going to prove that yz is a morphism on 
all of W xr W. If W is not smooth over R, then as explained above (5.4.2), 
the special fiber W contains a unique singular point y and W° = W\~ {4}. 
In this situation we will show that jz is a morphism except at the single 
point 7 i. 

(7,7) €EWx,WCOW xRW. 


In particular, it is a morphism on W® x p W. 

In order to simplify our calculations, we will assume that the residue 
field & does not have characteristic 2 or 3 (equivalently, 2 and 3 are units 
in R). The general case is similar, but the formulas are considerably longer. 
This assumption allows us to make a change of variables in P?, and put our 
Weierstrass equation in the form 


W:Y?Z = X°?4 AXZ? 4+ BZ. 


In other words, W is the closed subscheme of Pe, defined by this homoge- 
neous equation. 
Let Warr be the affine open subscheme of W defined by 


Wat ={Z #0} CW, 


and let x = X/Z and y = Y/Z be affine coordinates on Wag. The addition 
map p is then given by the usual formula [AEC, ITI.2.3] 


b= [(2 — €1)((y2 — yx)? — (@2 — 21)? (a2 + 41)), 
(yo — 91)? + (@2 — 21)? (a1y1 — 22y2 + 2eay1 — 2wry2), (x2 — ta )x |, 


More precisely, this formula gives the restriction of 4 to Wag XR Wart, 
and on this affine scheme jz will be a morphism except possibly along the 
closed subscheme where its three coordinate functions vanish. Looking 
at the third coordinate and then at the second coordinate, we see that py 
is a morphism on Wag XR Wag except possibly on the closed subscheme 
defined by the equations 


tg- 2, =yo2-yi =0. 
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In other words, 4 is a morphism off of the diagonal. 
To deal with points on the diagonal, we use the relations 


yi=a22+Ar+B and ys =23+Ar2+ B 
which hold identically on Wag x R Wag to rewrite the addition map p as 


= [yi + yo) ((@1 + @2)(i + yo)? + (af + e1xq + 23 + A)?’), 
(x? + 21x + x3 + Ay? 
= (yp y2)? (x1 + x2)? + A(x; +22) + B- yiy2) (yi + y2)°]. 


(See [AEC, III.3.6.1] for a similar calculation.) Just as above, we see that ju 
is a morphism on Wag XR Wag except possibly on the closed subscheme 
defined by the equations 


ytye =a? +ajrg+234+A=0. 


We have now proven that yu is a morphism on Wag XR War except on 
the subscheme defined by the four equations 


$2 - 2, =yo—- I =M t+ yo =e] + a142+25+A=0. 


A little algebra and the fact that 2 € R* shows that this subscheme is 
defined by the equations 


t=, yi. = yo =O, 32? + A=0. 


In particular, it is contained in the diagonal of Wag x R Wag, so if we iden- 
tify Wag with this diagonal, then yu is a morphism except on the subscheme 


y =327+A=0. 


Using the relation y? = 2? + Av + B and the fact that 3 € R*, we see that 
the discriminant 44? + 27B? is contained in the ideal generated by y and 
3a? + A. Hence if W is smooth over R, which implies that its discriminant 
is a unit in R, then uw is a morphism on all of Wag XR War. Similarly, if W 
is not smooth over R, then pu will be a morphism on Wag XR War away 
from the subscheme 


£2 = 21, yi = yo = 0, 3274+ A=0, 4A? + 27B? =0, 


which is precisely the singular point on the special fiber of the diagonal. 
Next let Wi, be the affine open subscheme of W defined by 


at = {Y FO} CW. 
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Notice that W x Rp W is covered by the four affine subschemes 
Wart XR Warr, Wart XR War Wis XR Wart, Wha XR Wars 


since W does not intersect the scheme Y = Z = 0. We have already 
dealt with the restriction of yz to the first set, so it remains to show that u 
is a morphism on each of the last three. We will leave this task for the 
reader (exercise 4.22), since the proof is similar to the argument given 
above. 

Finally, we observe that the negation map 


i:WxrW—W, — [X,Y,Z] — [X,-Y,Z], 


is a morphism on W, since it is actually the restriction of a morphism on 
P%, xz P?,. This completes the proof of Theorem 5.3. oO 


§6. Existence of Néron Models 


In this section we are going to prove the existence of Néron models for ellip- 
tic curves. The proof, which closely follows the exposition of Artin [1, §1], 
is largely scheme-theoretic and is at a more advanced level than the other 
material in this chapter. The reader who is willing to accept the statement 
of Theorem 6.1 should read Remarks 6.1.1-6.1.3 and can then proceed to 
the next section with no loss of continuity. 


Theorem 6.1. Let R be a Dedekind domain with fraction field K, 
let E/K be an elliptic curve, let C/R be a minimal proper regular model 
for E/K (4.5), and let E/R be the largest subscheme of C/R which is 
smooth over R. Then €/R is a Néron model for E/K. 


Remark 6.1.1. The generic fiber of © is the non-singular curve EF, so © 
has only finitely many singular fibers. Each fiber of C consists of one or 
more irreducible components, possibly with multiplicities (see §4), say 


Tp 
Cy = > Np iF pi- 
i=l 


Then € is formed by discarding from € all F,,’s with np; > 2, all singular 
points on each Fy;, and all points where the F,;’s intersect one another. 
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Remark 6.1.2. Continuing with the notation from (6.1.1), the group 
scheme €/R comes equipped with an identity element, which is an R-valued 
point o9 € E(R). The image oo(R) of the identity element is a curve on C 
which will intersect the fiber Cy at the point ao(p). Proposition 4.3 tells us 
that oo(p) will be a non-singular point of Cy, so it will lie on an Fy; hav- 
ing multiplicity np; = 1. The component of @, containing oo(p) is called 
the identity component of Cy. The scheme obtained by removing all non- 
identity components from the fibers of € is called the connected component 
(of the identity) of C. The image oo(R) of the identity element lies in €, so 
we can define the identity component of Ep and the connected component 
(of the identity) of € in an analogous manner. The connected component 
of € is a subgroup scheme of €; see exercise 4.25. We will see later (9.1) 
that it is isomorphic to the smooth part of the scheme defined by a minimal 
Weierstrass equation for EP. 


Remark 6.1.3. With notation as in (6.1), one can prove that the group 
law € xp E — E extends to give a group scheme action 


ExrC —€C. 


See exercise 4.23. We proved a special case of this in (5.3c). 


Before beginning the proof of (6.1), we want to say a few words explain- 
ing why the smooth part of a minimal proper regular model for E/K turns 
out to be a Néron model. In other words, how will we use the four proper- 
ties “smooth,” “minimal,” “proper,” and “regular”? First, the properness 
of € over R ensures that E(i,) = C(R) (4.4a). Next, the regularity of C tells 
us that every R-point lies in the smooth part of € (4.4b), so C(R) = E(R). 
This gives E(k) = €(R), which is an important case of the Néron mapping 
property (5.1.3). Thus the properness and regularity of C are mainly used 
to obtain the Néron mapping property. On the other hand, the smoothness 
and minimality of € are used to prove that € is a group scheme over R. 
In particular, the minimality implies that any K-automorphism of FE, such 
as a translation-by-P map for some point P € E(K), will extend to give 
an R-automorphism of €. These translation maps on € will be essential for 
showing that the group law on E'/K extends to give € the structure of a 
group scheme over R. 

With these preliminary comments completed, we begin the proof of 
Theorem 6.1. The first step in the proof is the following generalization of 
a theorem of Weil [3]. Weil’s theorem asserts that a rational map from a 
smooth variety to a complete group variety is automatically a morphism, 
and Artin [1, Prop. 1.3] has extended Weil’s theorem to a scheme-theoretic 
setting. 


Proposition 6.2. (Weil [3], Artin [1]) Let R be a Dedekind domain, 
let G/R be a group scheme over R, let X/R be a smooth R-scheme, and 
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let 6: X --+ G be a rational map over R. Write Dom(@) for the domain 
of @, and suppose that Dom(@) is dense in every fiber of X/R. 

(a) The complement X \ Dom(¢@) is a subscheme of X of pure codimension 
one. 

(b) If G is proper over R, then Dom(¢) = X. In other words, ¢ is a 
morphism. 


Proor. We will write (g,h) = gh and i(g) = g~!. Further, to simplify 


our exposition and help reveal the underlying ideas, we will phrase our 
argument in terms of points. But the reader should be aware that in 
order to be completely rigorous, our “points” should be T-valued points for 
arbitrary R-schemes T. As an alternative, the proof can be given purely 
scheme-theoretically, a task which we will leave for the reader. 

Having made this disclaimer, we begin by considering the rational map 


F:XxpX--->G,  F(x,y) = o(z)d(y)*. 
We claim that there is a natural identification 
Dom(¢) —+ AN Dom(F), xz (2,2), 


where A is the diagonal in X x zp X. To see this, take a point z € Dom(¢). 
Then F(z,2z) = ¢(x)¢(x)~1 is defined, so (z,x) € Dom(F). Conversely, 
if (x, x) € Dom(F), we can use the fact that Dom(£’) is open to find a non- 
empty open set U C X so that « xr U C Dom(F). Next, since Dom(¢) is 
open, we can find a point y € UM Dom(@). It then follows from 


o(x) = F(x, y) oly) 


that ¢ is defined at x, so x € Dom(@¢). This completes the proof of the 
claim. 

Let K(X x X) be the function field of the scheme X x p X, and let Ocg.o 
be the local ring of G along the identity section; that is, Og, is the ring of 
rational functions on G which are well-defined at some point on the image 
of the map oo : Spec(R) — G, where ao is the identity element of the group 
scheme G. 

The rational map F' defines a ring homomorphism 


F* :O0g9 — K(X x X), frofoF. 


Let f € Ogo. If x € Dom(¢), then (2,2) € Dom(F) from the claim 
proven above, and further F(z,x) = ¢(x)¢(x)~1 is the identity element 
of G, so F*(f) is defined at (x, x2). Conversely, if F*(f) = f o F is defined 
at (x,x) for all functions f € Og o, then F must be defined at (x, x). This 
proves that 


x € Dom(¢) => (a,x) € Dom(F* f) for all f € Ogo 
==> F*(Oao) C Oxxx(2,2) 


328 IV. The Néron Model 


where Ox xx (2,2) C A(X x X) is the local ring of X xz X at (2,2). 

The scheme X xp X is smooth over R, so in particular it is normal. 
This implies that a function f € K(X x X) will be defined at (a,x) unless 
its polar divisor div..(f) goes through (x, x). (This is a standard property 
of normal schemes. It follows, for example, from Hartshorne [1, II.6.3A].) 
In other words, the local ring Oxy x(2,2) can be characterized as 


Oxxx,(a,2)= {g € K(X x X)* : (a,x) ¢ divoo(g) }U{O}. 
Combining this with our description of the domain of ¢ from above yields 


X\ Dom(¢) = {# EX : F*(Ogo) £ Oxxx,(x,2)} 
= {EX : (a,x) € div.(F*f) for some f € Ogo} 


“An [J divo(F*f) 


fEOG,0 


= LU (Andiv..(F*f)). 


FENG. 


The diagonal A is a complete intersection in X x zg X, and each divisor 
div..(f*f) has pure codimension one in X xp X, so each of the intersec- 
tions AN div..(f*f) has pure codimension one in A. It follows that the 
union over f € Og also has pure codimension one in A, since we know 
a priori that it is a proper closed subset of A. This completes the proof 
of (a). 

(b) The following lemma (6.2.1) says that a rational map from a smooth 
scheme to a proper scheme is defined off of a subset of codimension at least 
two. Then (6.2.1) and (a) imply (b). 


Lemma 6.2.1. Let R be a Dedekind domain, let X/R be a smooth R- 
scheme, let Y/R be a proper R-scheme, and let ¢: X --» Y be a dominant 
rational map defined over R. Then every component of X ~ Dom(@) has 
codimension at least two in X. 


Proor. Let Z C X be an irreducible subscheme of codimension one in X. 
We need to show that ¢ is defined at the generic point of Z (i.e., ¢ is defined 
on a non-empty open subset of Z). Consider the local ring Ox z of X at Z. 
It is a local ring of dimension one, and it is regular since X/R is smooth, 
so it is a discrete valuation ring. 

The dominant rational map ¢: X --+ Y induces a morphism 


Spec K(X) — Spec K(Y) 


from the generic point of X to the generic point of Y. In other words, 
composition with # induces an inclusion of function fields K(Y) @ K(X). 
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This gives us the commutative diagram 


4 i y 


T 
Spec Ox. 2 | 
T 
SpecK(X) —> Spec K(¥). 


The discrete valuation ring Ox,z has fraction field K(X), and we are given 
that Y is proper over R, so the valuative criterion of properness (2.7) implies 
that the rational map 


ShecOpeoa aay 


extends to a morphism Spec Ox.z — Y. This says precisely that ¢ is defined 
at the generic point of 2, which completes the proof of (6.2.1), and with it 
also the proof of (6.2b). Oo 


We can use (6.2) to find the Néron model of an elliptic curve with 
good reduction. 


Corollary 6.3. Let R be a Dedekind domain with fraction field k, 
let E/K be an elliptic curve given by a Weierstrass equation 


y” + ayzy + agy = e+ anu? + a4qxv + a6 


having coefficients in R, and let W C P% be the closed subscheme of PR 
defined by this Weierstrass equation. Suppose that W is smooth over R 
or, equivalently, that the Weierstrass equation has good reduction at every 
prime of R. Then W/R is a Néron model for E/K. 


PrRooF. Theorem 5.3 says that the addition law on E/K extends to make 
W into a group scheme over the localization of R at each of its prime ideals. 
These group laws are given by the same equations, so they fit together to 
make W into a group scheme over R. It remains to verify that W has the 
Néron mapping property. 

Let X/R be a smooth R-scheme with generic fiber X/K, take any 
rational map ¢x : X/x --* E/x defined over K, and let ¢ : X --» W be 
the associated rational map over R. The fact that W is a closed subscheme 
of PZ, implies that it is proper over R (2.8), so we can use (6.2b) to deduce 
that the rational map ¢ extends to a morphism. This proves that W/R 
has the Néron mapping property, so it is a Néron model for E'/K. Oo 


Our next step is to prove that if the scheme € in (6.1) is a group 
scheme, then it will be a Néron model for £, at least provided that the 
ring R is large enough. The precise property we will require R to have is 
described in the following definition. 
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Definition. A discrete valuation ring R is called Henselian if it satisfies 
Hensel’s lemma; that is, R is Henselian if for any monic polynomial f(x) € 
R[x] and any element a € R satisfying 


f(a) =O(modp) and __f"(a) # 0 (mod p), 
there exists a unique element a € R satisfying 
a =a(mod p) and f(a) =0. 


The ring RF is called strictly Henselian if it is Henselian and if its residue 
field &k = R/p is algebraically closed. (Remember our residue fields are 
perfect. The usual definition of strictly Henselian requires k to be separably 
closed.) 


For example, if R is a discrete valuation ring, then the completion of R 
with respect to its maximal ideal p is Henselian. We have seen many in- 
stances in which it is helpful to work with complete discrete valuation rings, 
for example in our study of formal groups [AEC, IV §6] and the reduction 
theory of elliptic curves [AEC, VII §2]. In particular, we used Hensel’s 
lemma for complete discrete valuation rings to prove the surjectivity of the 
reduction map Eo(K) — Ens(k) in [AEC, VII.2.1]. 

However, for many purposes the completion is too large, since the 
completion of R will generally not be flat over R. The following general- 
ization of [AEC, VII.2.1] says that the reduction map is surjective for any 
Henselian ring. 


Proposition 6.4. Let R be a discrete valuation ring with maximal ideal p 
and residue field k, let X/R be a smooth R-scheme, and let X/k be its 
special fiber. Consider the reduction map 


X(R) — K(k). 


(a) If R is Henselian, then the reduction map is surjective. 
(b) If R is strictly Henselian, then the image of the reduction map is dense 
in X. 


PROOF. (a) Replacing X by an affine neighborhood, we can assume that 
X=SpecA with A= Rlty,...,tnar]/(fi,---, fr) 


for certain polynomials fi,...,fn € Riti,...,tn4,]. Further, the assump- 
tion that X is smooth over R means that the n x n minors of the Jacobian 


matrix a 
Ot; 1l<i<n 


1<j<ntr 
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generate the unit ideal in A. 7 - 
Choose any point b = (b1,...,6n) € X(k); that is, let b € A” (R) satisfy 


fi(b) =--- = fr(b) =0 (mod p). 
We need to construct a point 3 € A"(R) with the property that 
B=b(modp) and fi (8) =--- = fn(B) =0. 


For example, suppose that X is locally a hypersurface (i.e., m = 1), 
given by the single equation f(ti,...,én4,) = 0. Then the Jacobian con- 
dition says that the partial derivatives Of /Ot; generate A, so in particular 
one of the values (Of /0t;)(b) must be a unit in A, say (Of /0t1)(b) € A*. 
Thus the polynomial F(t) = f(t, bo,...,0n4,) satisfies the hypotheses of 
Hensel’s lemma, so it has a root 6; € R with 6, = 6;(modp). The 
point 6 = (31, b2,...,b,) € X(R) reduces modulo p to the original point b 
in X(k), which completes the proof in this case. We will leave the general 
case, which is somewhat more difficult, for the reader to complete, or see 
the references for exercise 4.27. 

(b) The residue field & is algebraically closed, so it is clear that X,(k) is 
dense in X,. Now (b) follows from (a). Oo 


As the next proposition explains, every discrete valuation ring R can 
be embedded in a minimal (strictly) Henselian ring. 


Proposition 6.5. Let R be a discrete valuation ring with maximal 
ideal p, residue field k, and fraction field K. Let K* be a separable closure 
for K, let R® be the integral closure of R in K*, and choose an ideal p* 
of K* lying above p. Let 


D= {o € Geek < a(p*) = p*}, 
IT={o€D: o(x)-x€ ps forall x € R*} 


be the associated decomposition and inertia groups. 

(a) Let RS(D) denote the subring of R® fixed by D, and define R® to be 
the localization of R°(D) at the maximal ideal pS N R8(D). Then R" is 
Henselian. It is called the Henselization of R. 

(b) Let R8(I) denote the subring of R® fixed by I, and define R*" to be the 
localization of R8(I) at the maximal ideal pS M R8(I). Then R*” is strictly 
Henselian. It is called the strict Henselization of R. 

(c) With the obvious notation, we have 


vl 


p? = pR, kh = k, i = HA, Kesh = 
Further the natural map 


Gigens ih mE Gisk 
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of Galois groups is an isomorphism. 


PRooF. (a) We are going to verify that R has the Henselian property. 
Let f(x) € R®[z] be a monic polynomial, which we may assume to be 
irreducible and separable over K™. (If f is inseparable, then f’(x) is iden- 
tically 0, so the Henselian property is vacuously true!) Factor f over K® 
as 

f(x) = (@ — a4) (@ — ag)+++ (a — aa). 


Suppose that a € R® satisfies f(a) € po and f’(a) ¢ p®. This implies that 
there is exactly one root, say a = a;, with the property that a—a € p*. 
Further, if o € D is any element of the decomposition group of p*, then we 
have 

a—o(a) =o0(a—a) € o(p*) = p®. 


But there is exactly one root of f which is congruent to a, so o(a@) must 

equal a. This shows that a is fixed by D, and hence a € R?. 

(b) The proof that Rs is Henselian is similar to the proof of (a), and it 

is a standard fact that the residue field is an algebraic closure of k; see for 

example Serre [4, I §7]. (Remember we are assuming that k is perfect.) 

(c) Again these are standard properties of Galois extensions of local fields. 
O 


Remark 6.6.1. It is clear from the construction (6.5) that the fraction 
fields K> and K** are separable algebraic extensions of K. Note that in 
general the fraction field of the completion of R will be transcendental 
over K, in fact, of infinite transcendence degree. This is one reason why it 
is often better to work with the Henselization. The moral is that one should 
work with Henselizations if one only needs to solve polynomial equations, 
but one has to go to the completion in order to use convergent power series. 
We also mention that strictly Henselian rings play the same role for the 
étale topology that local rings play for the Zariski topology. 


Remark 6.6.2. The Henselization can also be described in terms of a 
universal mapping property, which essentially says that it is the smallest 
Henselian extension of R, and similarly for the strict Henselization. See 
exercise 6.28. 


Proposition 6.7. Let R be a strictly Henselian discrete valuation ring 
with fraction field K, let E/K be an elliptic curve, let C/R be a minimal 
proper regular model for E/K (4.5), and let €/R be the largest subscheme 
of C/R which is smooth over R (6.1.1). If the group law on E/K extends to 
make € into a group scheme over R, then €/R is a Néron model for E/K. 


Proor. Let X/R be a smooth R-scheme with generic fiber X/K, and 
let Ox : X --» E bea rational map. In order to verify the Néron mapping 
property, we must show that ¢@x extends to a morphism X —> €. 
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The elliptic curve FE is a proper group scheme over K, and X is smooth 
over K, so applying (6.2b) with R = K to the map ox : X --» E, we see 
that @x% extends to a morphism. This means that the rational map 


b:X---+€ 


induced by ¢x is a morphism on the generic fiber. 

We suppose that ¢ is not a morphism and derive a contradiction. 
By assumption, € is a group scheme, so (6.2a) tells us that the set of 
points where ¢ is not defined is a set of pure codimension one in X. Hence 
there is an irreducible closed subscheme 2 C X such that ¢ is not defined 
at the generic point nz of Z. Note that the generic point of Z is given 
by nz = Spec Ox.z, and that the local ring Ox,z is a discrete valuation 
ring because X is regular and Z has codimension one. We now have the 
following picture: 


x cant € c e 
; 
nz = Spec Ox,z | | 
T 
Spec K(X) 25, SpecK(€) = Spec K(€). 


The scheme € is proper over R, and Ox.z is a discrete valuation ring, 
so the valuative criterion of properness (2.7) says that ¢ extends to a mor- 
phism ¢ : nz — C. In other words, if we map to € rather than to the smaller 
scheme €, then ¢ is defined generically on Z. 

We are assuming that ¢: X --» € does not extend generically to Z, 
or equivalently that (nz) € © is not contained in €. In particular, if 
we let k be the residue field of R and take any point xp € 2(k) so that 
@:X --+ C is defined at xo, then O(ao) ¢ E. (This is another way of saying 
that @: X --» € is not defined generically on Z.) 

The set of R-valued points X(R) maps to a dense set of points in the 
special fiber of X by (6.4b). Note that this is where we use our assumption 
that R is strictly Henselian. In particular we can find a point x € X(R) 
which intersects Z at a point, call it x9 € Z(k), at which the map @: X --+ C 
is defined. Composing zx with ¢ gives a rational map 


Spec(R) > X-%3e 


which by the valuative criterion of properness (2.7) extends to a morphism 
Spec(R) — ©. In other words go x € C(R), and by our construction it 
is clear that @oax ¢ E(R). However, (4.4b) says that C(R) = €(R). This 
contradiction completes the proof that ¢ extends to a morphism X — €, 
and hence that € has the Néron mapping property. Oo 
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In order to complete the proof of Theorem 6.1 for strictly Henselian 
rings, it remains to show that the scheme € is a group scheme over R. This 
is done in two steps. First, we prove that there exists some group scheme A 
over R such that € and A have isomorphic dense open subsets. Second, 
using the group operation on A, we show that € and A must be isomorphic. 
The proof of the first part uses an argument of Weil to construct a group 
variety (or scheme) by pasting together group chunks. We will not give 
the full proof of Weil’s result but will be content to give a brief sketch and 
refer the reader to Artin [1, §2] for the details. 


Definition. Let R be a Dedekind domain, let V/R be a smooth R-scheme 
with non-empty fibers, and let 


wi VxRV---+V 


be a rational map defined over R. The map yp is called a normal law on V 
if it satisfies the following two conditions: 


(i) The map yp is associative; that is, 
u(u(x.y),2) = w(x, u(y, z)) whenver both sides are defined. 


(ii) Define rational maps 


o:V XR V---> VxrVv w:iV¥ xp V---> VxrVv 
(x,y) > (x, u(x, y)), (x,y) > (y,u(z,y)). 


Then the domains of definition of ¢ and y contain a dense subset of 
each fiber of V xz V, and the restriction of @ and w to each fiber is a 
birational isomorphism. 


Remark 6.8. If G is a group scheme over a Dedekind domain R, then its 
group law is a normal law. Condition (i) is true because the group law 
on G is associative by definition, and condition (ii) is immediate since ¢ 
and yw are isomorphisms. For example, the inverse of w is the map 


GxrG—GxrG, (x,y) > ((u(y, i(x)), 2). 


For a general normal law yp, the requirement that the rational maps ¢ 
and y satisfy condition (ii) provides a sort of inverse for the hidden group 
law that yu is trying to emulate. 


The following theorem says that a normal group law on V makes a 
large chunk of V into a large chunk of a group scheme. 


Theorem 6.9. (Weil [3]) Let R be a Dedekind domain, and let V/R be 
a smooth R-scheme of finite type over R. Suppose that every fiber of V is 
non-empty and that V(R) is dense in each fiber. 
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Let be a normal law on V. Then there exists a group scheme G/R 
of finite type over R, an open subscheme U/R Cc V/R, and an open sub- 
scheme U'/R C G/R with the following two properties: 

(i) U and U' are dense in every fiber. 
(ii) There is an R-isomorphism U = U' so that the normal law js restricted 
to U coincides with the group law of G restricted to U’. 


ProoF. (Sketch) The underlying idea is to start with a good open sub- 
scheme U of V and construct G as a union of translates of U. More pre- 
cisely, for each x € U(R), let U, be a copy of U. Then one treats Uz as if 
it were “U translated by x” and uses py to provide gluing data to attach U; 
to U. For further details, see Artin [1, Thm. 1.12]. oO 


The next result, combined with (6.7), will complete the proof that €/R 
is a Néron model for E/K, at least over strictly Henselian rings. 


Proposition 6.10. Let R be a strictly Henselian discrete valuation ring 
with fraction field K, let E/K be an elliptic curve, let C/R be a minimal 
proper regular model for E/K (4.5), and let E/R be the largest subscheme 
of C/R which is smooth over R (6.1.1). Then the group law on E extends 
to make € into a group scheme over R. 


ProorFr. The proof consists of two steps. First, we verify that the group 
law on F defines a normal law on €. This allows us to apply (6.9), which 
yields a group scheme G/R that is birational to €. The second step is to 
show that the resulting birational map € — G is actually an isomorphism. 
Notice that the proof is somewhat indirect. Rather than proving that the 
group law on FE extends to €, we instead construct an auxiliary group 
scheme G which extends the group law on EF, and then we show that G 
must equal €. 

We begin with the assertion that the group law on F defines a normal 
law 


pi ExXpE ---4 €E. 


The associativity of jz is clear, since jz is associative on the generic fiber 
of €, and a rational map is determined by its restriction to any dense open 
subset. So it remains to verify that the maps 


@:EXRE---4 EXRpE w:rE xXpE---9 EXRE 
(x,y) +> (x, u(z,y)), (z,y) => (y,n(x,y)). 


are defined on a dense subset of the special fiber and that their restrictions 
to the special fiber are birational isomorphisms. 

Suppose that R’ is any discrete valuation ring which is the localization 
of a smooth R-scheme, and let K’ be the fraction field of R’. Then the 
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minimal proper regular model of E over the field K’ is C x p Spec(R’). We 
apply this fact, taking R’ to be the ring 


R=0c-= the local ring of E at a generic 
£36 point € of its special fiber € 


Note that the special fiber € need not be irreducible, so it may have several 
generic points, one for each component. We can take R’ to be the local 
ring at any one of these generic points. Note also that here is where we 
use the fact that € is smooth over R, since this fact implies that R’ is the 
localization of a smooth R-scheme. 

There is a natural map Spec(R’) = € — €, in other words an R’-valued 
point of €. We let 7 be the corresponding translation map, 


7: Spec(R’) xp € — Spec(R’) xp E. 


This translation map is an automorphism on the generic fiber, so it fol- 
lows from (4.6) and the minimality of C xp Spec(R’) that 7 is actually a 
morphism. 

The map 7 on Spec(R’) xp E is translation on the second factor by 
the generic point of the first factor, so we obtain a commutative diagram 


Spec(R')xz& —> Spec(R’) xz & 


| | 


exe ss ER 

This proves that @ is defined at every generic point of the special fiber 
of € Xp € lying over €. But € is an arbitrary generic point of the special 
fiber €, which implies that ¢ is defined at every generic point of the special 
fiber of Ex €. A similar argument can be applied to w, which proves that 
the domains of definition of ¢ and yw contain a dense subset of the special 
fiber of E Xp €. This verifies the first part of property (ii) in the definition 
of a normal law. 

Now take any point P € €(R), and let tp : € — E be the automor- 
phism of € extending the translation-by-P morphism on the generic fiber 
of F (4.6). Then the map 


B= Speen) spe. = “OC xyt: “2 -exné 
is precisely the map Px Tp, so it is one-to-one. It follows that the fibers of @ 
are not all positive dimensional. Therefore ¢ must be generically surjective, 
and hence a birational isomorphism on the closed fiber of € x gE. Similarly 
for yw, which completes the verification that yz: is a normal law on E. 
We can now apply (6.9) to deduce the existence of a group scheme 
G/R, open subschemes U/R c € and U'/R C G/R which are dense in 
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every fiber, and an R-isomorphism U = U’ so that the restriction of pp to U 
coincides with the group law of G restricted to U’. The isomorphism U = 
U’ extends to give an R-birational map \: G — €, and the proof of (6.10) 
will be complete if we can show that 4 is an isomorphism. 

As described in (3.3), any point P € G(R) defines a translation mor- 
phism tp : G — G. The generic fiber of G is E/K, so P gives a point 
in E(K). Translation-by-P on E/K is an automorphism of the generic 
fiber of €. It follows from (4.6) and the minimality of € that this induces 
a translation map on €, which we will also denote by tp : € — €. Now 
if g € G is any point, then we can find a P € G(R) so that tp(g) € U’. 
(Note R is strictly Henselian, so G(R) maps to a dense subset of the spe- 
cial fiber of G from (6.4b).) Then we can extend the definition of \ to a 
neighborhood of g by using the fact that A = T_po Ao7Tp at every point 
where it is defined, since the right-hand side is clearly defined at g. This 
proves that is a morphism. 

On the other hand, the map ’~! : € > G in the opposite direction is 
a rational map from a scheme smooth over R to a group scheme over R. 
Suppose that A~! is not a morphism. Then Weil’s theorem (6.2) tells us 
that there is an irreducible curve Z C € such that A~! is undefined at the 
generic point 7, of Z. But we know that A7! is defined on U, and U is 
dense in the special fiber, so Z cannot be a component of the special fiber. 
It follows that 7, is contained in the generic fiber E of €. In other words, 
there is a point of E at which A~! is not defined. But on the generic fiber, A 
is the identity map E — E. This contradiction shows that A~1 is defined 
everywhere on €. Therefore \ is an isomorphism, and hence € is a group 
scheme over R. Oo 


We now have all of the tools needed to prove the existence of Néron 
models for elliptic curves over Dedekind domains. 


PROOF (of Theorem 6.1). If the ring R is strictly Henselian, then combin- 
ing (6.7) and (6.10) shows that € is a Néron model for E/K. There are 
two more steps needed to complete the proof of (6.1). First, we have to 
descend from the strict Henselization of a discrete valuation ring down to 
the ring itself. Second, we have to glue together Néron models over discrete 
valuation rings to create a Néron model over a Dedekind domain. 

So suppose first that R is a discrete valuation ring, and let RS" be the 
strict Henselization of R (6.5). Then @** = @ xp R° will be a minimal 
proper regular model for E/R*". To see this, we note first that proper 
morphisms are stable under base extension (Hartshorne [1, IT.4.8c]), so C*? 
is proper over R®", Next, the regularity of C*" follows, since @ is regular 
and R® is flat and unramified over R. Finally, the minimality of CS is 
a consequence of the construction of the minimal proper regular model 
in terms of a regular model with all exceptional curves blown down. See 
Lichtenbaum [1], Shafarevich [1], Chinburg [1], and the discussion in §7 of 
this chapter. 
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Letting €S* = € xp R®, it is clear that €° is the largest open sub- 
scheme of CS which is smooth over R®", so our previous work implies 
that €8>/RS" is a Néron model for E/K*". We are going to use this fact 
to verify that € has the Néron mapping property over R. 

Let X/R be a smooth R-scheme with generic fiber X/K, and let dx : 
X;K — Ey« be a rational map defined over AK. Consider the extension 
of X and dx to R®, say X8* =X xp R and d§ : XG coh — Ejxen. The 
scheme X** is smooth over R*" from Hartshorne (1, I[I.10.1b] or Altman- 
Kleiman [1, VIL.1.7], so the Néron mapping property for €*" tells us that o5} 
extends to a unique morphism ¢ : Xa = CFh This gives us the 


commutative diagram sh 
ysh  2R,  gsh 


| | 


Oe es ae 

where the top row is obtained from the bottom row using the base extension 
Spec R®" — Spec R. The strict Henselization R*" is faithfully flat over R 
(Bosch-Liitkebohmert-Raynaud [1, 2.4, corollary 9]), so this is exactly the 
situation in which we can apply faithfully flat descent (see, e.g., Bosch- 
Liitkebohmert-Raynaud [1, Chap. 6] or Milne [4, I §2]) to conclude that 
the rational map on the bottom row is a morphism. Therefore €/R has the 
Néron mapping property, which concludes the proof that €/R is a Néron 
model for F/K in the case that R is a discrete valuation ring. 

Finally, suppose that R is a Dedekind domain. From what we have 
already done, we know that for each prime p € Spec R, the localization 
€ Xp Ry is a Néron model for E over Ky. Further, (6.3) tells us that if we 
fix a Weierstrass equation W/R for E/ i, and if we let S C Spec R be the 
set of primes for which W has bad reduction, then W x pr Rg (i.e., the part 
of W lying over Rg) is a Néron model for E over Rg. This gives the Néron 
model over a dense open subset of Spec R, and gluing this large piece to 
the finitely many bad fibers produces a Néron model over all of Spec R. 
This completes the proof of (6.1). i) 


87. Intersection Theory, Minimal Models, and Blowing-Up 


In Chapter III we saw amply demonstrated the power of intersection theory 
as a tool for studying the geometry of surfaces. In this section we will 
describe, without proof, the analogous theory on arithmetic surfaces due 
to Lichtenbaum [1] and Shafarevich [1]. Unfortunately, the fact that an 
arithmetic surface is not complete means that it is not possible to define 
an intersection theory on the full divisor group, but we will be able to 


§7. Intersection Theory, Minimal Models, and Blowing-Up 339 


compute intersections with divisors which lie on the special fiber. In the 
next section we will use intersection theory to completely describe all of 
the possible special fibers for a minimal proper regular model of an elliptic 
curve. We will work over a discrete valuation ring, rather than a Dedekind 
domain, since everything we do in this section can be done fiber-by-fiber. 

Let R be a discrete valuation ring with maximal ideal p and residue 
field k = R/p, and let C/R be an arithmetic surface over R. The scheme C 
is normal by definition, so there is a good theory of Weil divisors on C€ 
as described in Hartshorne [1, II §6]. An irreducible divisor I on € is a 
closed integral subscheme of dimension one, in other words a curve, and 
the divisor group Div(C) of € is the free abelian group generated by the 
irreducible divisors. Further, each non-zero function f € K(C) defines a 
principal divisor 


div(f) = $o ordr(f)P € Div(e), 
Cr 


and as usual we say that two divisors are linearly equivalent if the differ- 
ence is principal. Here ordy is the normalized valuation associated to the 
irreducible divisor I’; see (4.1.2). 

Let [ € Div(C) be an irreducible divisor and let x € Cy be a point on 
the special fiber of C. Informally, a uniformizer for T at x is a function 
which vanishes to order 1 along [ and has no other zeros or poles in a 
neighborhood of x. More precisely, a uniformizer for T at x isa function f € 
Oe,« in the local ring of C at x with the property that 


ordr(f) =1, and ordy(f) = 0 for all irreducible IY AT with x € I’. 


To see that such a function exists, we need merely note that if x € TI, 
then Oe, is a discrete valuation ring containing the integrally closed local 
ring Oe 2. 


Definition. Let ['},[2 € Div(C) be distinct irreducible divisors and let 
x € © be a closed point on the special fiber Cy of ©. Choose uniformiz- 
ers fi, fo € Oe, for [1,TP2 respectively. The (local) intersection index 
of 1; andT2 at x is the quantity 


(Ti -T2)2 = dim, Oe,2/(fi, fa). 


Notice that this is the same definition that we gave in (III §7) for 
the local intersection index on geometric surfaces. Just as in Chapter III, 
we would like to add up these local intersection indices to get a global 
theory. Further, we would like our intersection theory to have the important 
functorial properties described in (III.7.2). In particular, linearly equivalent 
divisors should give the same intersection index. Unfortunately, it is not 
possible to define an intersection theory on Div(C) with this property. The 
problem is that C is not complete. This is true even if € is proper over R, 
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which will ensure that the fibers are complete, because the base Spec(R) 
itself is not complete. This non-completeness means that it is possible to 
use a linear equivalence to move an intersection point “out to infinity,” 
where it then disappears. The following simple example illustrates this 
difficulty. 


Example 7.1. Let C = P = Proj R[X,Y], and consider the two divisors 
Tr, = {xX =0} and Tyg ={X+7"Y = 0}. 


Here a € R is a uniformizer for the maximal ideal p of R, and n > 1 is an 
integer. These two divisors intersect at the point 


e={X=nr=0}€C,=Ph 


on the special fiber. To compute the local intersection index we dehomog- 
enize by setting Y = 1 and then compute 


(11 : T2)e = dim; R[X] (x) /(X,X + nT) = dim; R/(x”) =n. 
The divisor [2 is linearly equivalent to the divisor [3 defined by 


eye 
TP; =To4+div (a) ={Y¥ =0}. 


Notice that [, and [3 have no points in common. So the linear equiva- 
lence Ty ~ T'3 has caused the intersection point of [, and [2 to disappear. 


The preceding example gives us two options. Either we can drop the 
requirement that intersections be invariant under linear equivalence, or we 
can restrict the allowable divisors. As we will see in the next section, it 
is extremely important to be able to compute the intersection of a divisor 
with itself, and to do this we need to be able to move the divisor in some 
way while not changing total the intersection index. So we will adopt the 
second alternative and restrict the set of divisors. 

The irreducible divisors on an arithmetic surface C/R come in two 
flavors. First, there are the components of the special fiber, as described 
in §4. Second, if I c € is an irreducible divisor which does not lie in the 
special fiber, then the map I — Spec(R) will be surjective. An irreducible 
divisor which is a component of the special fiber is called a fibral divisor, 
and an irreducible divisor which maps onto Spec(R) is called a horizontal 
divisor. For example, the image o(Spec R) of a section o € C(R) is a 
horizontal divisor. 


Definition. A divisor D = )°\n,J; is called fibral if every component I; 
of D is a fibral divisor. The group of fibral divisors on C is denoted 


Div,(C) = {D € Div(€) : D is fibral}. 


Notice that Div,(C) is a subgroup of Div(C). 
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Definition. A divisor D = }>n,J; is called positive if every n; > 1. The 
set of positive divisors clearly does not form a group. 

The fibral divisors are fairly rigid, at least in the sense that their 
intersections generally cannot be moved off of the special fiber by a linear 
equivalence. This makes them suitable for intersection theory as described 
in the following result. 

Theorem 7.2. Let R be a discrete valuation ring, and let C/R be a 
regular arithmetic surface which is proper over R. There is a unique bilinear 
pairing 

Div(C) x Div,(C) — Z, (D,F)+-> D-F, 
with the following properties. 
(i) IfT € Div(C) and F € Div,(€) are distinct irreducible divisors, then 


Poh Soar a 
zernF 
(ii) If D}, D2 € Div(C) and F € Div,(C) are divisors with D linearly 
equivalent to Do, then D, - F = D2- F. In particular, 
div(f)-F =0 for all f € K(€)* and all F € Div,(€). 


The intersection pairing also has the following symmetry property. 
(iii) If F, F2 € Div,(C) are fibral divisors, then F, - Fy = Fo - F,. 


Proor. Just as in the geometric case (III.7.2), the main idea is to use 
linearity and the linear equivalence property (ii) to reduce the computation 
of D-F to the case of distinct irreducible divisors, and then apply (i). The 
principal difficulty, as always, is to show that the result is independent of 
the various choices made. For details, see Lichtenbaum [1], Shafarevich [1], 
Lang [6, HI §§2,3], or Chinburg [2, §4]. oO 


Remark 7.2.1. Let C = PL = Proj R[X,Y], let a € R be an element 

which is not a square in R, and consider the two irreducible divisors 
Peq{qx*=aY*} and FHey=Pi: 

Then I'M F consists of either one or two points, depending on whether or 

not a is a square in the residue field k. If a ¢ k, then T and F intersect 


at the one point 
z={n=X?—aY? =0} EG, 


where 7 is a uniformizer for R. We compute the intersection index at x by 
dehomogenizing Y = 1, 


([- F), = dimg R[X](x2~a)/(", X? — a) = dim, k[X]/(X? — a) = 2. 
Similarly, if \/a € k, then T and F intersect at the two points 
y={r=X-VaY =0} and z={nm=X+VaY =O}. 
We will leave it to the reader to check that (T- F), = (T- F), =1. 
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We proved (III.8.2) that the intersection pairing on a fibered surface 
is negative semi-definite on fibral divisors, with kernel the entire fiber. The 
argument given in Chapter III carries over almost verbatim to give the 
same result for arithmetic surfaces. 


Proposition 7.3. Let R be a discrete valuation ring with maximal 
ideal p, and let C/R be a regular arithmetic surface proper over R. 
(a) The special fiber C, is connected. 
(b) Let F € Div,(C) be a fibral divisor. Then F? < 0, and the following 
three conditions are equivalent: 
(i) F? =0. 
(ii) F- F’ =0 for every F" € Div,(C). 
(iii) F = aC, for some a € Q, where Cy = © xp p is the special fiber 
of © with appropriate multiplicities; see §4. 


PROOF. (a) This is a special case of Hartshorne [1, III.11.3]. 

(b) Clearly, (ii) implies (i). Further, the divisor C, is principal, since 
it is equal to div(7) for a uniformizer 7 € R, so (7.2ii) shows that (iii) 
implies (ii). The fact that F? < 0 and the remaining implication (i) > 
(iii) are proven in exactly the same way as the geometric case (III.8.2). 
For further details, see Lichtenbaum [1], Shafarevich [1], or Lang (6, III 
Prop. 3.5]. oO 


To simplify our discussion for the remainder of this section, we are 
going to assume that our discrete valuation ring R has an algebraically 
closed residue field k. In practice, most of what we say will remain true 
with some slight modifications. 

With this assumption, an irreducible fibral divisor Ff’ on € is an irre- 
ducible curve defined over k. If we further assume that © is proper over R, 
then F will be proper (hence projective) over k. Recall that the arithmetic 
genus of such a curve F'/k is defined to be 


PalF) => dim; H'(F, Op). 


More generally, if we write the special fiber as C, = S>n,Fj, then any 
positive fibral divisor F = S>a;F; is a one-dimensional scheme over k. 
If F’ is connected, the arithmetic genus of F’ is defined by exactly the 
same formula. See Hartshorne [1, exercise III.5.3] for a discussion of the 
arithmetic genus. The next proposition describes the few facts that we will 
need to know about the arithmetic genus. 


Proposition 7.4. Let R be a discrete valuation ring with maximal 
ideal p, fraction field K, and algebraically closed residue field k. Let C/R 
be a regular arithmetic surface proper over R. 
(a) (Adjunction Formula) There is a divisor Ke € Div(C) with the property 
that 

F? + Ke: F =2p,(F)—2 for every F € Div,(€). 
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The divisor Ke is called a canonical divisor on ©. (N.B. The adjunction 
formula is only valid for fibral divisors.) 
(b) Let C/K be the generic fiber of ©. Then 


Pa(€p) = Pa(C) = g(C), 


where g(C) is the usual genus of C/K [AEC, II.5.4]. 
(c) Let F € Div,(C) be an irreducible fibral divisor. Then pa(F) > 0, 
and pa(F) = 0 if and only if F is isomorphic to Pj. 


Proor. (a) The classical adjunction formula for a non-singular curve on 
a non-singular surface is proven in Hartshorne [1, V.1.5], and the case of 
singular curves is described in Hartshorne [1, exercise V.1.3]. The adjunc- 
tion formula for arithmetic surfaces is due to Lichtenbaum [1, Thm. 3.2]. 
For further information, see also the discussion in Lang [6, remark 1 on 
p. 117]. 

(b) The first equality follows from the general fact that in a flat family, the 
arithmetic genus of the fibers remains constant (Hartshorne [1, II.9.10]). 
The second equality is Hartshorne [1, IV.1.1], since C is non-singular. 

(c) The inequality pz(f’) > 0 is clear, since by definition the arithmetic 
genus is the dimension of a certain cohomology group. For the second 
assertion, see Hartshorne [1, IV exercise 1.8(b)]. Oo 


Remark 7.4.1. The description of the canonical divisor Ke in (7.4a) is 
not, of course, the usual definition. A canonical divisor is normally defined 
to be the divisor of a differential form of top dimension, so in the case of 
an arithmetic surface C@, the divisor of a differential 2-form on C. But for 
our main application (8.1) in the next section, we will only need to know 
that there exists some divisor satisfying the adjunction formula (7.4a). In 
fact, it would suffice to know that the map 


Div,(C) — Z, F +> 2p,(F) -2— F’, 


is a homomorphism; see Lichtenbaum {1, Thm. 3.2]. 


Earlier (4.5) we stated the existence of a minimal proper regular model 
for a curve C/K. We now want to briefly describe how such models are 
constructed and give Castelnuovo’s criterion for minimality. If C/R is any 
regular model for C/K, and if x € Cy is a point on the special fiber of €, 
then we can blow up z to get another regular model €’/R and a birational 
morphism 

o:@e’ —+€. 
(See Hartshorne [1, I §4, II §7, V §3] and the discussion (7.7, 7.7.1) at the 
end of this section.) The map ¢ is an isomorphism away from z, and the 
inverse image of z is a divisor D = ¢~!(x) with the property that 


D=P, and D?=-1. 


344 IV. The Néron Model 


Thus blowing up zx has the effect of replacing x by a projective line whose 
self-intersection is —1. 

In general, an irreducible fibral divisor D € Div,(C) which satis- 
fies D = Pj and D? = —1 is called an exceptional divisor or an exceptional 
curve. Castelnuovo showed (in the geometric setting) that such curves can 
always be blown back down. 


Proposition 7.5. (Castelnuovo’s criterion) Let R be a discrete valua- 
tion ring with fraction field K and algebraically closed residue field k, and 
let C/K be a non-singular projective curve of genus g > 1. 

(a) Let @'/R be a proper regular model for C/K (4.5a), and let D € 
Div,(C’) be an exceptional divisor on C’. Then there exists a proper regular 
model C/R for C/K and a birational morphism @¢ : C’ > € so that c = ¢(D) 
is a single point and ¢ is the blow-up of € at zx. 

(b) Let © be a minimal proper regular model for C/K. Then © contains 
no exceptional divisors. 


PROOF. (a) See Hartshorne [1, V.5.7] for a proof in the geometric situation. 
The arithmetic version is due to Lichtenbaum [1, Thm. 3.9] and Shafare- 
vich [1, p. 102], see also Chinburg [2, Thm. 3.1]. 

(b) If © contains an exceptional curve, then (a) says that we can blow 
it down to get a smaller regular model for C/K. But a blow-down map 
is clearly not an isomorphism, which contradicts the assumed minimality 
of €. a 


Remark 7.5.1. Continuing with the notation from (7.5b), it can be shown 
that a proper regular model C/R for C/K is minimal if and only if it con- 
tains no exceptional divisors. Further, starting with any proper regular 
model for C/R, one can produce a minimal model by blowing down ex- 
ceptional curves until none are left. The geometric case is described in 
Hartshorne [1, V §§3,5]. The arithmetic case is due to Lichtenbaum [1, 
Thm. 4.4] and Shafarevich [1, p. 126]; see also Chinburg [2, Thm. 1.2]. 


Remark 7.6. Let R be a Dedekind domain, say the ring of integers of a 
number field AK, and let C/R be an arithmetic surface. Then we can define 
an intersection pairing 


Div(C) x Divgp(C) — Z, 


where Divg,(C) denotes the group of divisors generated by the components 
of the special fibers and the pairing is defined linearly using (7.2). Unfor- 
tunately, if we want to retain the linear equivalence property (7.2ii), then 
it still is not possible to extend this pairing to all of Div(C). As before, 
the underlying problem is that Spec(R) is not complete, so intersection 
points can move out to infinity and disappear. Arakelov [1] had the bril- 
liant idea of adding in some extra fibers “at infinity.” More precisely, he 
adds one fiber for each archimedean absolute value of AK, and then uses 
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tools from differential geometry to define real-valued local intersection in- 
dices on these archimedean fibers. Arakelov’s intersection theory extends 
to the full divisor group while retaining the linear equivalence property, 
and many of the most important theorems from the classical geometry of 
surfaces, such as the Riemann-Roch and adjunction formulas, extend to 
the Arakelov setting. For more information about Arakelov intersection 
theory on arithmetic surfaces, see for example Chinburg [1], Faltings [1], 
or Lang [6]. 


The final topic we want to discuss in this section is the blowing-up 
process. This is described with varying degrees of generality in Hart- 
shorne [1, I §4, II §7, V §3]. With an eye towards the explicit computations 
we will be doing in §9, we offer the following brief primer on blowing-up 
surface singularities. 


Remark 7.7. (Blowing-Up Singularities on Arithmetic Surfaces) Let R be 
a discrete valuation ring with uniformizing element 7 and residue field k. 
Let © c A?, be an arithmetic surface defined by a single equation 


f(x,y) =0 for some polynomial f(z, y) € Riz, y]. 


In other words, C = Spec R[x, y]/(f). In order to ensure that © is a two- 
dimensional scheme whose special fiber has dimension one, we will assume 
that f is not a constant polynomial and that at least one coefficient of f 
is a unit in R. In fancier terminology, this means that C€ is flat over R. 

Keep in mind that © is a “surface” (i.e., a two-dimensional scheme) 
sitting inside the three-dimensional scheme A. Intuitively, the three “co- 
ordinate functions” on A? ® are 7, x, and y, and in order to calculate the 
special fiber we always set 7 = 0. 

We are going to assume that C has a singularity at the point 7 =z = 
y = 0 on the special fiber. In other words, we assume that 


of 


(0,0) = 54 (0,0) = 3 


—(0,0)=0 (mod z). 


Let m = (m,2z,y) € © be the singular point on the special fiber of C. 
Then the blow-up of © at m is formed by taking the following three schemes 
and gluing them together as explained below. 


Chart 1. Define new variables 
C= TL, and yY=T7Y1, 
and let v be the largest integer so that 
f(r21,7) =m" fi(zi,yi) with fi(vi,y1) € Rlxi, yi]. 


In other words, factor out a power of 7 so that the coefficients of f; are 
in R and at least one coefficient is a unit. Then the first coordinate chart 
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for the blow-up of € at m is the scheme C; C A? = Spec R{x1, y:] defined 
by 
C; : Spec Rar, y1)/(fi(21, y1))- 


Chart 2. The second chart is formed using new variables x’, x’, y’ defined 
by 
nay’, c= oy’, y=y’. 

We substitute these into the polynomial f(x,y). This means we do two 
things. First, we replace x and y by a’y’ and y’. Second, we take each 
coefficient a of f(x,y) and replace the largest power of 7 dividing a by that 
power of z’y’. For example, if 7?|a and 7° + a, then we would replace a 
by (x’y')?x~a. We factor out the largest possible power of y’ to get 


fly y)=W) Fey’) with f’(a’,y') € Rin’, a',y'], 
and then the second coordinate chart of the blow-up is the scheme 
C’: Spec R[x’, 2’, y'|/(a — a'y’, f(a’, y’)). 


Note that 7’ is a new variable, just like the variables x’ and y’. The 
scheme €’ is the closed subscheme of A?, = Spec R[n’, x’, y’] defined by the 
two equations 7 = n'y’ and f'(2’,y’) =0. 

Chart 3. The third chart is formed similarly to the second chart using the 
variables 1", x’", y"” defined by 


qa aa", A = x", y= ya". 


Substituting these into f(x,y) as explained above and pulling out the 
largest power of x” gives 


f(a”, y"2") = (0 \”" f(x", y"") with f(a", y”) E Rio”, y”]. 
Then the third coordinate chart of the blow-up is the scheme 
e” : Spec Rin”, x", y|/(r _ mw ag, f(a y")). 


It is easy to see how to glue the three coordinate charts together. For 
example, in order to map C€, to @’, we just need to solve for (7’, x’, y’) in 
terms of (7,21, y1). Thus 

ee Tw 1 , & cy : 
Ts ae te A A Y =YrTyY1- 
y y Y1 7] y U1 
These equations define a birational map C; — C’ which is defined every- 
where except at the points of C; with y; = 0. Similarly, we get a birational 
map C, — @” by using the equations 


T 1 ii 
TST FS7-FST]; gL =X=7WT, Us SSS a 
x 
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and a birational map C@’ > C@” using 


These maps are used to glue the three coordinate charts together, and the 
resulting scheme is the blow-up of € at m. 

In order to find the special fiber of the blow-up, we take the special 
fibers of each of the coordinate charts and then glue them together. The 
special fiber of a coordinate chart is calculated by setting 7 = 0 and looking 
at the resulting curve defined over k. The first coordinate chart is easiest, 
and we find that 5 , 

Ci = Spec k[z1,91]/(fi(21, 91))- 


In other words, @; is the curve in A? defined by the single equation f; =0. 

Similarly, the special fiber of @’ is obtained by setting 7 = 0, which 
means that 7 

C! = Spec k[r', 2", y/]/(r'y', f’(2’,y')). 

Here x’ is to be treated as a variable, so @’ consists of two pieces, one 
obtained by setting x’ = 0 and the other obtained by setting y’ = 0. Of 
course, each piece may consist of several components, or a piece could be 
empty. Finally, C” is given by 


e” = Speck[r” 2" ,y"\/(n"'2", f(x", y")), 


so @” also consists of two pieces, one with 7” = 0 and the other with x” = 0. 


Example 7.7.1. We are going to illustrate (7.7) by blowing-up the arith- 
metic surface 
C:2274y=n4 

at its singular point 7 = x = y = 0. For simplicity, we will assume that 
the residue field does not have characteristic 2, 3, or 5. 

To find the first coordinate chart of the blow-up, we substitute 2 = 72, 
and y = my, into the equation for @ and cancel ?, which yields 

Cy: a2 t+ ny? = 1. 

The second coordinate chart is obtained by substituting 7 = n'y’, x = 2’y’, 
and y = y’, and then canceling y’ * to obtain 


2 3 4 72 
Cia’ ty any”, w=T'y’. 


Finally, to get the third chart we substitute m = 72", © = 2”, and y = 


2 
yc" and cancel x” to get 


3075 4 2 
eC’. 1 4+ a" y” = 7x" _ t= rg". 
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Next we compute the special fibers by setting 7 = 0. The special fiber 
of C; is 7 
Ci: x? = 0, 


so @; isa non-singular rational curve appearing with multiplicity 2. 

To find the special fiber of C’, we must set 7 = 1’y’ = 0. This gives us 
two pieces, one with 7’ = 0 and one with y’ = 0. In this way we find two 
fibral components, which we will denote by Fj and F}: 


af Fir =0, 2% +y"=0, 
2F3:y' =0, a” =0. 


Keep in mind that F{ and F} are curves in A} = Speck{z’, 2’, y’]. Thus Fj 
is a rational curve with a cusp, whereas F% is a non-singular rational curve 
which appears with multiplicity 2 in the fiber. In other words, as a divisor 
we have @! = F/ + 2F%. 

Similarly, the special fiber of C” is obtained by setting = 1’2” = 0. 
However, when we set x” = 0 we obtain the equation 1 = 0, so x” = 0 does 
not give any components of @”. Hence @” consists of the single rational 
curve 

Cl sgl = 0,1= gly”, 

We claim that when we glue €,, €’, and €” together, the special 
fiber @” is identified with F/, and the special fiber @ is identified with 2F3. 
To verify the first statement, we observe that the special fiber C” is defined 
by the equations 7” = 0 and 1 = xy". According to (7.7), the gluing 
map @’ — @” is given by the substitutions 


rl = n/a’, gl = z'y’, y" = 1/2’. 


Substituting these into the equations for @” yields 
n'/2'=0 and 1 = (2'y’3(1/2')® = y"?/2”, 


which are exactly the equations of F{. We leave it to the reader to verify 
the assertion that C, is glued to 2F%3. 

Thus the special fiber of @’ contains all of the components of the special 
fiber of the blow-up. If @’ were regular, we would have completed our 
construction of a regular model, but unfortunately it is not regular. In 
fact, every point on the component F 3 of the special fiber is a singular 
point, so next we want to blow up @’ along the entire curve y’ = 0. We 
didn’t discuss blowing up a surface along a curve in (7.7), but the procedure 
is very similar. 

Recall that @’ c A?, = Spec R[x’, 2’, y’] is given by the equations 


2 3 4 72 
Cl.’ +y’ = ny”, n=q'y’. 
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The Special Fiber $ of the Scheme $: X2+Y =T*, 7 =TY 
Figure 4.3 


To blow up @’ along the curve y’ = 0, we make the substitutions 
2 =XY, y=y, weT, 


and cancel Y? from the first equation. This yields the arithmetic sur- 
face 8 C A}, = Spec R[T, X,Y] given by the equations 


8:X°?4Y=T1, w=TY. 


Note that $8 is regular at X = Y = T = m7 = O, since the equations 
for $ show that the maximal ideal at that point is generated by the two 
variables X and T. One can similarly verify that & is regular at all other 
points, so it is a regular model for C. 

To compute the special fiber S of 8, we set = TY = 0. The part 
with T = 0 is the non-singular rational curve X? + Y = 0, whereas the 
part with Y = 0 factors as 


Maa (XP yx eT) =0, 


so it consists of two non-singular rational curves which intersect tangen- 
tially. All three of the components of § intersect at the point X = Y = 
T = 0, so the special fiber 8 looks as illustrated in Figure 4.3. 

Label the three components of 8 as indicated in Figure 4.3, 


Fi: Y=X-T?=0, Fo:Y=X4+T’?=0, Fy:T=X74+Y =0. 


Looking at Figure 4.3 or directly from the equations for the components, 
we can compute the pairwise intersections 


Fi-Fe=2, Fi -Fe=1, Fy Fy=1. 


Next, using the fact (7.3) that the intersection of a component with the 
entire fiber 8 = F, + F> + F3 is zero, we compute the self-intersections, 
F? = -(F,-Fo+ F,- Fs) = -3, 
FR = -(Fh- Fi, + Fy: F3) = -3, 
F3 = —(F3- F, + F3- Fo) = -2. 
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This shows that F,, F2, and F3 are not exceptional curves, so § is a minimal 
regular model. 
We have now computed the incidence matrix of the special fiber, 


FL: Fi F,- Fo F\- Fs -3 2 1 
Fo: Fi Fo: Fy Fo-F3 | = 2> ‘ade 
F3-F, F3-Fo F3- Fs 1 1 #2 


The incidence matrix of the special fiber of a minimal proper regular model 
of a curve can be used to compute the group of components of the Néron 
model of its Jacobian variety, see Raynaud [1] and exercises 4.32 and 4.33. 
In this example, the 2 x 2 minors of the incidence matrix have determi- 
nant 5. Raynaud’s theorem then implies that the group of components of 
the Néron model of the Jacobian variety is a cyclic group of order 5. 


88. The Special Fiber of a Néron Model 


In this section we are going to describe the Kodaira-Néron classification 
of special fibers on minimal proper regular models of elliptic curves. Our 
main tool will be the intersection theory described in the previous section. 
We will work over a discrete valuation ring with algebraically closed residue 
field. In the next section we will give an algorithm of Tate which computes 
the special fiber and also provides some additional information, including 
a description of what happens when the residue field is not algebraically 
closed. 

We begin with a proposition which describes the intersection properties 
of the components of the special fiber. The most important part of this 
proposition is the last formula in (d), since it is this formula which puts 
severe constraints on the possible configurations of the components. 


Proposition 8.1. Let R be a discrete valuation ring with maximal 
ideal p, fraction field K, and algebraically closed residue field k. Let E/K 
be an elliptic curve, and let C/R be a minimal proper regular model 
for E/K. Suppose that the special fiber of © contains r irreducible compo- 
nents, say F,,..., F, and write the special fiber as 


i=1 


(a) At least one of the n,’s is equal to 1. 
(b) Let Ke be a canonical divisor on © (7.4a). Then 


Ke: F =0 for all fibral divisors F € Div,(C). 
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(c) Ifr =1, then F? =0 and pa(Fi) = 1. 
(d) Suppose that r > 2. Then for each 1 <i <r, 


F? =-2, F,&Pi, and In ee ee 
1<j<r, j#i 


PRooF. The scheme € is proper over R, so C(R) & E(K) from (4.4a). By 
definition, an elliptic curve E/K has at least one K-rational point, namely 
its identity element, so we can find a point P € C(R). Let P(p) € Cp be 
the image of P on the special fiber of C, and let F; be a component of Cy 
containing P(p). The scheme @ is regular, so (4.3b) says that P(p) is a 
non-singular point of Cy. It follows that n; = 1, since if n; > 2, then every 
point of F; would be a singular point of C,. This completes the proof of (a). 

Next we consider the special fiber C, as a divisor on ©. It has the 
following three properties: 


C0 from (7.3b) 
Pa(Cp) = g(E) = 1 from (7.4b) 
C2 + Ke: Cy = 2pa(Cp) —2 adjunction formula (7.4a). 


Substituting the first two equations into the third gives 
Ke: Cp = 0. 


We next apply the adjunction formula (7.4a) to an irreducible fibral 
component F; to get 


F? + Ke: F, = 2pa(Fi) — 2. 


The arithmetic genus of an irreducible divisor is non-negative (7.4c), since 
by definition it is the dimension of a certain cohomology group. Thus the 
right-hand side of the adjunction formula is at least —2. On the other hand, 
we know that F? < 0 from (7.3b). This leads to the following possibilities: 


(i) F?=0 forsome1<i<r. 
(ii) F2<0O and Ke-F;<0 forsomel<i<r. 
(iii) F2 <0 and Ke-F,>0 foralll<i<r. 


In case (i) we know from (7.3b) that F; must be a multiple of the entire 
special fiber. More precisely, we must have r = 1 and C, = nF). Then (a) 
tells us that n,; = 1, so F; = Cy. Now the equality Ke-C, = 0 proven 
above gives us (b), and the facts CS = 0 and p,(Cp) = 1 noted above give 
us (c). This completes the proof of Proposition 8.1 in case (i). 
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Next consider case (ii). Each term in the left-hand side of the adjunc- 
tion formula is negative, whereas the right-hand side is at least —2. The 
only way for this to happen is if 


pe =Ke-F,=-1 and Pa(F;) = 0. 


Now (7.4c) says that F; & P!, and hence F; is an exceptional divisor. But 
Castelnuovo’s criterion (7.5b) says that a minimal proper regular model 
contains no exceptional divisors, so case (ii) cannot occur. 

It remains to consider case (iii). The strict inequality F? < 0 implies 
in particular that r > 2. We take the equality Ke-C, = 0 proven above 
and write it out in terms of the fibral components as 


t=21 


Each n; > 1, and since we are in case (iii), each Ke - F; > 0, so the only 
way that this can be true is if we have 


Ke: F; =0 for alll <i<r. 


This proves (b). 
Next we substitute Ke- F; = 0 into the adjunction formula for F;, 
which yields 
F? = 2p.(Fi) — 2. 


We are in case (iii), so F? <0, whereas p.(F;) > 0 from (7.4c). It follows 
that 
F? =-2 and Pal F;) = 90, 


7 


and then (7.4c) tells us that F; ~ Pj}. Finally, we note that C,-F; = 
F,-C@, = 0 from (7.3b), which allows us to compute 


(S64 we Sas Ss; nF; + Fj. 


j=1 1<9Sr, G#t 
This completes the proof of (d). ob 


We are now going to use (8.1) and a combinatorial argument to give 
the Kodaira-Néron classification of the fibers of minimal proper regular 
model of elliptic curves. 


Theorem 8.2. (Kodaira [1], Néron [1]) Let R be a discrete valuation 
ring with maximal ideal p, fraction field K, and algebraically closed residue 
field k. Let E/K be an elliptic curve, and let C/R be a minimal proper 
regular model for E/K. Then the special fiber Cy of © has one of the 
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following forms. (See Figure 4.4. Note that the small numbers in Figure 4.4 

indicate the multiplicities of the components.) 

Type Ip. Cy is a non-singular curve of genus 1. 

Type l;.  @, is a rational curve with a node. 

TypelIn. Cy, consists of n non-singular rational curves arranged in the 
shape of an n-gon, where n > 2. 

Type II.  €, is a rational curve with a cusp. 

Type III. @, consists of two non-singular rational curves which intersect 
tangentially at a single point. 

Type IV. ©, consists of three non-singular rational curves intersecting at 
a single point. 

Type Ij. Cy» is a non-singular rational curve of multiplicity 2 with four 
non-singular rational curves of multiplicity 1 attached. 

Type I}. Cy» consists of a chain of n+ 1 non-singular rational curves of 
multiplicity 2, with two non-singular rational curves of multi- 
plicity 1 attached at either end. 

Type IV". @, consists of seven non-singular rational curves arranged as 
pictured in Figure 4.4. 

Type III". @, consists of eight non-singular rational curves arranged as 
pictured in Figure 4.4. 

Type II*. (, consists of nine non-singular rational curves arranged as 
pictured in Figure 4.4. 


Remark 8.2.1. If the residue field k of R is not algebraically closed, 
then €, may have some components which are irreducible over k but be- 
come reducible over a finite extension of k. In other words, the Galois 
group Gx, may act non-trivially on the k-irreducible components of Cy, 
and then the k-irreducible components of C, are the orbits. We will discuss 
this situation further in the next section when we describe Tate’s algorithm. 


Remark 8.2.2. The dual graphs of the pictures in Figure 4.4 turn out 
to be extended Dynkin diagrams. There is a discussion of this in Mi- 
randa (1, I §6], as well as a proof of (8.2) based on the negative semi-definite 
quadratic forms attached to the extended Dynkin diagrams. The proof re- 
lies only on the facts proven in (7.3), namely that C, is connected and that 
the intersection pairing on Cy is negative semi-definite with kernel equal to 
the entire fiber. 


Remark 8.2.3. We will use Kodaira’s [1] notation I, II, ..., II*, I* 
to describe the various types of special fibers (8.1). There is a second 
notational system, due to Néron [1], which is also in common use. For the 
convenience of the reader, we briefly list the equivalences. 


poise In wiyol tr 
Pro Pe Le s[etele 
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Reduction | Number of Configuration 
Type Components (with multiplicity) 


The Kodaira-Néron Classification of Special Fibers 
Figure 4.4 
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Remark 8.2.4. Ogg [3] and Namikawa and Ueno [1] have given a classi- 
fication, similar to (8.2), for the special fibers of proper regular minimal 
models of curves of genus 2. It turns out that there are more than 100 
configurations! 


PROOF (of Theorem 8.2). We will write the special fiber of C, as usual as 


cM 
i=1 
We are going to consider a number of cases, which we will box for clarity. 


C, has r = 1 component 


Proposition 8.1(a,c) tells us that n; = 1 and pg(C,) = 1, so Cy = F{ is an 
irreducible curve of arithmetic genus 1. If C, is non-singular, then it is a 
non-singular curve of genus 1 (Hartshorne {1, IV.1.1]), so we have Type Ip. 
If C, is singular, then Hartshorne [1, V.3.7] and the fact that pa(C,) = 1 
means that a single blow-up of a singular point on @, will produce a non- 
singular rational curve. Hence @, is a rational curve with exactly one 
singular point of multiplicity 2, from which it is not hard to show that the 
singular point is either an ordinary node or an ordinary cusp. This gives 
Types I, and II. 

(Alternative proof for r = 1. We will see in the next section that 
if r = 1, then the scheme W/R defined by a minimal Weierstrass equation 
for EF is already a regular scheme. It will follow that C = W, so the 
special fiber C, is obtained by reducing the minimal Weierstrass equation 
modulo p. But we already know from [AEC, VII §5] that the reduction W, 
is either a non-singular curve of genus 1, a rational curve with a node, or 
a rational curve with a cusp.) 

We assume henceforth that r > 2, which means that we can apply the 
formula given in (8.1d), 


>> Fy Fy = 2n. (x) 


I<i<r, j#% 


Note that each n; > 1, so every term in the sum is non-negative. We 
will be making frequent use of this important formula (*). We also note 
from (8.1d) that every component is a non-singular rational curve; that is, 
F, = Pi for every 1 <i<r. 

Proposition 8.1(a) says that one of the n,’s equals 1, so relabeling 
the F;,’s if necessary, we may assume that n, = 1. We further know that Cy 
is connected (7.3a), and r > 2 by assumption, so after further relabeling 
we may also assume that F|- Fo > 1. 


€, has r = 2 components 


This means that C, = F\ + n2Fo, so applying (*) for i = 1 and i = 2 gives 
noFy- F, = 2 and Fy. Fy = 2no. 
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Further, Fy, - Fy = F)- F, from (7.2iii), so we deduce that 

Fy, > Fy =2 and ng =1. 
This means either that F and F> intersect tangentially in a single point, 
which gives Type III, or else they intersect transversally at two distinct 


points, which gives Type Ig. This completes the analysis of the special 
fiber in the case that Cy, has exactly two components. 


C, has r > 3 components 


We claim in this case that intersecting components always intersect trans- 
versally; that is, we claim that 


Fp Fy <1 forall 1 <i,’ <r,iFi’. 


To see this, we use the fact that C, is connected and contains at least three 
components to find a third component, say F;,, so that fF, intersects at 
least one of F; or Fy, say Fy - F; > 1. Applying (*) to F; and to Fj gives 
the inequalities 


ny Py Fy < ny Fy + Fy t+ ng Fy: F< 2n; and nj; F;- Fy < 2nj;. 
We now multiply these two estimates to obtain the strict inequality 
NN; (F; : Fy)? < Anjnj. 


Therefore F; - Fj, < 2, and since the intersection index is an integer, we 
find that F; - Fj <1 as desired. 

In particular, we have F\-F) = 1. Using this and the fact that n, = 1, 
we can apply («) to F, to obtain the bound 


Ng = NOP: Fo < 2ny = 2. 


Thus nz equals either 1 or 2, which leads to two further subcases. 


Applying (*) to F2 gives 


Tr rr 
ny Fy Fo +S njF; » Fy =2n2, and hence Soe -Fo=1, 
j=3 j=3 


since F,:-F> = 1 and n, = ng = 1. This means that there is exactly one more 
component intersecting Fo, call it F3, and ng = 1. If F3 also intersects F), 
then we get two possible configurations depending on whether or not Fi NF3 
is the same point as F2F3. If they are the same point, then we get a fiber 
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of Type IV, and if they are not, then we get three rational curves arranged 
in a triangle, which is Type Is. 

Suppose now that F3 does not intersect F;. Then applying (*) to F3 
gives 


nF: F3+ Sak -F3 =2n3, and hence S05 F; - Fz = 1, 


gad j=4 


since Fy - Fy = 1 and ng = n3 = 1. Therefore there is exactly one more 
component intersecting F3, call it Fy, and n4 = 1. If F4 also intersects F}, 
then we have four rational curves arranged in a square, which is Type J4. 
If Fy does not intersect F\, then applying (*) to Fy gives in the same way 
one more component Fx intersecting Fy, and n; = 1. The fiber C, has only 
finitely many components, so this process must eventually terminate. More 
precisely, since C, has r components, the process will terminate with F;, 
intersecting F,. At this point we will have r rational curves, each of multi- 
plicity 1, arranged in the shape of a polygon, which means that the fiber C, 
is of Type I... 


Applying (*) to F\ gives 


noky- Fy + So 05 F; -F, =2n,, and hence So nj F; -F, =0, 
j=3 j=3 


since F, - Fp = 1, ny = 1, and ng = 2. This means that there are no more 
components intersecting F;. Next applying (*) to Fo gives 


mFy- Fy + > 5 F; - Fy =2n2, and hence So nj Fj Fe = 3. 
j=3 j=3 


Thus F> intersects either one, two, or three additional components. 

Suppose first that Fp intersects three additional components, which 
we label F3, Fy, and Fs. Then n3 = ng = ns = 1, which gives a fiber of 
Type Ij, and applying (*) to F3, Fy, and Fs; shows that C, contains no 
other components. 

Next suppose that F 2 intersects exactly two other components, say F3 
and Fy. Switching these two components if necessary, we have n3 = 1 
and n4 = 2. Thus C, contains the configuration illustrated in Figure 4.5(a), 
where the small 2’s next to Fy and FY indicate that they are components 
of multiplicity 2. Of course, C, may contain some additional components. 
Let s => 4 be the largest integer so that C, contains the configuration 
illustrated in Figure 4.5(b). Applying (*) to F; for any 4 <i < 5s gives 


Tr T 
ny—1Fi-1-Fy+nigi Fig. Fit >. nF; -F; =2n;, so > nj Fy Bj = 0, 
J=stl j=stl 


308 IV. The Néron Model 


(a) (b) 
Building a Fiber of Type I* 
Figure 4.5 


since Fy_, - Fi = Fig. > Fy = 1 and nj) = n; = ni41 = 2. Thus there are 
no more components intersecting any of F4,...,F';~1. On the other hand, 
applying (*) to F, gives 


Tr Tr 
ns—1Fs-1°: Fo + oS nj Fj; =2ns, and hence x nj Fj - Fs = 2. 
j=stl j=stl 


If F, intersects exactly one additional component, then that compo- 
nent will have multiplicity 2, which means that C, contains the config- 
uration in Figure 4.5(b) with one more multiplicity-2 component. This 
contradicts our choice of s, so Fz must intersect two additional compo- 
nents, each of which has multiplicity 1. This gives us a fiber of Type I* 
(with n = s — 3), and it is then easy to check using (*) that there are no 
more components. This completes the proof in the case that F> intersects 
exactly two components in addition to Fy. 

Finally we suppose that F» intersects exactly one additional compo- 
nent, say F3, with multiplicity ng = 3. This means that C, contains the 
configuration illustrated in Figure 4.6(a). Let t > 3 be the largest integer 
so that C, contains the configuration illustrated in Figure 4.6(b). Apply- 
ing («) to F; for any 3 <7 < t gives 


‘fl - 
taihisi hei ie nj FjFi = 2ni, so s? nj fy Fy = 0, 
jat+1 j=t+l 


since 
Fy_4 .#F; = Fy4i- F; = 1, M1 =i-1, Nr; =1, and N41 =i+l. 


Thus there are no more components intersecting any of F3,..., F;-1. On 
the other hand, applying (*) to F; gives 


r 


- 
nei F_-1- Fy t+ > nj Pj Fy = 2nt, and hence > njFy- PF, =t+1. 
j=t+l jettl 
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F, 1 F3 3 
2 
Fo 
(a) 
F,]1 F313 
1 > 3 
Fy 
F,]1 F3]3 
1 > 3 
Fy 
(d) 
Building Fibers of Type IV*, III*, II* 
Figure 4.6 


If F, were to intersect exactly one more component, then that component 
would have multiplicity ¢ + 1, contradicting the fact that we chose the 
largest ¢ so that C, contains the configuration in Figure 4.6(b). Thus F; 
intersects at least two additional components, say Fy4) and Fy49. 

Let F; be any component intersecting F;. Applying («) to F; gives 


mi, Fi <2n;, and hence n,; > t/2. 


It follows that F; intersects only the two additional components Fy+1 
and Fi,2, and we have the estimates 


M41 + N42 =t+1, N41 > t/2, Ne+2 = t/2. 


Switching F;,; and Fi. if necessary, we may assume that ny, < neyo. 
Then there are only two possibilities, depending on the parity of t. 


t = 1 (mod 2) 
In this case we must have 


t+1 
N41 = N42 >= or 
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Applying (*) to Fi41 then gives 


, r 
nel Pega t+ ) nj Fy Fiat = 2nt41; and hence > nj, F; Fray = 1, 
j=t+3 j=t+3 


since Fy: Fi4, = 1, my =t, and ne41 = (t +1)/2. Therefore F,,1 intersects 
exactly one more component, say F343, with multiplicity nu43 = 1. Ap- 
plying the exact same argument to Fi+i2 gives the same conclusion, so Cy 
contains the configuration illustrated in Figure 4.6(c). 

Finally, applying (*) to Fii3 gives 


Tr ka 
3-¢t 
nea: Pagt De nj Fj Fis3 = 2nti43, so ye nF; Fiy3 = —., 


2 
j=tt5 j=tts 


since Fi4i- Fiy3 = 1, neg, = (+ 1)/2, and n43 = 1. The sum is non- 
negative, and t > 3 by assumption, so we must have t = 3. Therefore C, 
looks like Figure 4.6(c) with t = 3, which is precisely Type IV*. This 
completes the proof when t is odd. 


t = 0 (mod 2) 


In this case we must have 


t ead t+2 
n — nN = —. 
t+1 5) t+2 2 


Applying («) to Fi41 gives 
a ei 
mF Fea t+ S> nj PF: Fi41 = 2nt41, and hence S> nj Fy Fy. = 9, 
j=tt3 j=t+3 


since Fy - Fi4, = 1, m% = t, and n441 = t/2. Thus there are no additional 
components intersecting Fi+1. 
Next we apply («) to Fi,2. This gives 


¥- Tr 
mF Figat SY) nj Fj-Fiz2 =2my2, and hence S° njFj-Fiy2 = 2, 
j=tt3 j=tt3 


since Fy: Figg = 1, me = t, and neyo = (t + 2)/2. Hence Fi+2 intersects 
at least one additional component, say F143, whose multiplicity n¢+3 is 
either 1 or 2. So we now know that €, contains the configuration illustrated 
in Figure 4.6(d). 

Don’t despair, we’re almost done! Applying (*) to Fiis3 gives 


Tr 
Nep2F 412+ Frag + y nj Pj Frys = 2ne43, 
j=tt+4 
and hence 
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T 
t+2 (2—t)/2 if ne+3 = 1, 
S> nj Fj + Fits = 2nt43 — as Taos eee if N43 = 2, 
jat+4 
since Fiio- Fisg = 1 and nizg = (t + 2)/2. But the sum is non-negative, 
and t is even and > 3 by assumption, so we must have 


t=4or6 and nt43 = 2. 


If t = 6, then there are no additional components, and Figure 4.6(d) is 
exactly Type IJ*. Finally, if t = 4, then there is one more component F144 
hitting Fy.3, and its multiplicity is nz,4 = 1. This gives Type II*, which 
completes the proof of Theorem 8.2. Oo 
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In this section we are going to describe an algorithm of Tate which com- 
putes, among other things, the reduction type of an elliptic curve given by 
a Weierstrass equation. We set the following notation, which will be used 
throughout this section. 
R a discrete valuation ring with maximal ideal p, uniformizing 
element 7, fraction field K, perfect residue field k of character- 
istic p, and normalized valuation v. 


E/K an elliptic curve given by a Weierstrass equation 
E: y? + ary + ay = x? + ax? + age + ag. 


C/R — aminimal proper regular model of E over R (4.5b). 


€/R the largest subscheme of C/R which is smooth over R (6.1.1). 
Note that €/R is a Néron model for E/K (6.1). 


C/k = € xk, the special fiber of C. 

&/k = €xRrk, the special fiber of €. It is a group variety over k. 

€°/R_ the identity component of € (6.1.2); that is, €° is the open 
subset of € obtained by discarding the non-identity components 
of the special fiber. It is a subgroup scheme of € (exercise 4.25). 


Ee /k the identity component of the group variety €/k (1.5c). 


Tate’s algorithm is essentially a set of instructions for computing C 
and € from a given Weierstrass equation. For this reason its statement 
as a formal theorem has the unsatisfying form: “The following 11 step 
procedure leads to the stated results.” So before we describe the algorithm 
itself, we want to give two corollaries. This will serve to explain (if not 
to excuse) why the following results are called “corollaries,” when in fact 
they are really conclusions which can be deduced from the description and 
validity of Tate’s algorithm. 
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Corollary 9.1. Take a minimal Weierstrass equation for E'/K, and 
let W C P% be the closed subscheme defined by this equation. Fur- 
ther, let W°/R be the largest subscheme of W which is smooth over R. 
Then W° = €°; that is, W° is the identity component of a Néron model 
for E/K. 


Corollary 9.2. Take a minimal Weierstrass equation for E/K. We recall 
from [AEC, VII §2] the following notation: 
EB /k the reduction of the given Weierstrass equation modulo p. 
Ens(k) the set of non-singular points of E(k). 
Ej(K) ={PeEE(K): Pe Etkyne the set of points of E(K) with 
non-singular reduction. 
E\(K) ={Peé E(K): P= O}, the set of points of E(k) which 
reduce to the identity element. 
Further, let 
El(R) ={o0€ E(R): o(p) =O€ E(k)}. 
The isomorphism E(k) & €(R) described in (5.1.3) induces the following 
identifications: 


(a) 
E(K) > Eo(K) > E,(K) 


I I I 
CRY Se BOG) SS celCR) 


(b) 
E(K)/Ey(K) & €(R)/E°R) —> E(k)/E%Ck) 


l I l 


E(K)/E\(K) E(R)/E(R) — — E(k). 


Ile 


If K is complete, or even merely Henselian, then both inclusions in (b) are 
isomorphisms. 

(c) Ens(k) = €°(k). 

(d) The group E(K)/Eo(K) is finite. More precisely, if FE has split multi- 
plicative reduction, then E(K)/Eo(K) is a cyclic group of order —v(j(E)); 
otherwise, E(K)/Eo(K) has order 1, 2, 3, or 4. 


Remark 9.2.1. Note that Corollary 9.2(d) is exactly [AEC, VII.6.1], a 
result which was left unproven in [AEC]. The fact that E(K’)/Eo(K) is 
finite, even if the residue field k is infinite, played an important role in the 
proof of the criterion of Néron-Ogg-Shafarevich [AEC, VII.7.1], and we will 
use it again in the next section (10.2) when we prove a generalization of 
this criterion. 
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Remark 9.2.2. Corollary 9.2 can be used to bound the torsion subgroup 
of an elliptic curve defined over a local field or a number field. Con- 
tinuing with the notation set at the beginning of this section, we recall 
from [AEC, VII.3.1(a)] that the subgroup E£)(A) contains no prime-to-p 
torsion, since it is isomorphic to the formal group of &. Suppose now 
that E has additive reduction. Then €°(k) = Eys(k) = k* is a p-group 
from (9.2c), whereas the quotient E(K)/Eo(K) = €(k)/€°(k) is a group of 
order 1, 2, 3, or 4 from (9.2d). Using these facts and the exact sequences 


Oh 2. BECO eae) es £%(k), 
0 — E(K) — E(K) — E(K)/E(K) — 0, 


we conclude that if E/K has additive reduction, then E()tors has or- 
der ap® for some a € {1,2,3,4} and some e > 0. 

Now consider an elliptic curve & defined over a number field K. The 
torsion subgroup of E(A’) injects into the torsion subgroup of E(/,) for 
each completion of K, so the local estimate we just proved can often be used 
to obtain strong global estimates. For example, suppose that E has additive 
reduction at primes p,, p2 of AK with distinct residue characteristics pj, po. 
Then E(K)tors has order dividing 12, and if p; > 5, then E(K)tors has 
order at most 4. 


Tate’s algorithm, which we are now going to describe, computes the 
following quantities associated to the elliptic curve E/K: 


Type the reduction type of the special fiber over the algebraic clo- 
sure k of k. We will use the Kodaira symbols (8.2) to describe 
the reduction type. 


m(E/K) the number of components, defined over k and counted without 
multiplicity, on the special fiber C. 


u(De/K) the valuation of the minimal discriminant of E/K. 


f(E/) the exponent of the conductor of E/K. This quantity will be 
defined in $10, but for now we note that it can be computed 
using Ogg’s formula (11.1), 


f(E/K) = v(Dp/x) — m(E/K) +1. 


c(E/K) the order of the group of components &(k)/E°(k). Equiva- 
lently, c(E/K) is the number of components of the special 
fiber C which have multiplicity 1 and are defined over k. 
To ease notation, we will sometimes write 


m=mE/K), f= f(E/K), c= c(E/k), 


when the curve & and field K are clear from the context. 
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Remark 9.3. If K is a complete local field, then (9.2b) says that c(E/K) 
equals the order of E(K)/Eo(K). This quantity is the so-called Birch- 
Swinnerton-Dyer “fudge factor.” It is a sort of p-adic period and appears 


in the conjectural formula for the leading coefficient of L(E,s) around 
s = 1 [AEC, C.16.5]. See Tate [2, §5] for details. 


Tate’s Algorithm 9.4. (Tate [2|) The following algorithm computes the 
reduction type, the values of m(E/K), u(Dryx). f(E/K), and c(E/K), 
and the various other quantities described during the course of the algo- 
rithm. 


Remark 9.4.1. At the conclusion of Tate’s algorithm, one obtains a min- 
imal Weierstrass equation for the given elliptic curve. In practice, however, 
it is considerably easier to implement Tate’s algorithm if one knows, a pri- 
ori, that the initial Weierstrass equation is minimal. Further, if one only 
wants a minimal equation and is not interested in computing other quan- 
tities, such as the reduction type, then there are easier methods available. 
For example, if the characteristic p of k satisfies p > 5, then a given Weier- 
strass equation is minimal if and only if either u(c4) < 4 or v(cg) < 6. 
In general, one can use a short algorithm of Laska [1] to find a minimal 
Weierstrass equation. See also exercise 4.36 for another method. 


Remark 9.4.2. In the case that the residue field k is algebraically closed, 
we have assembled information about the various reduction types in Ta- 
ble 4.1. This table is taken, with minor modifications, from Tate [2, §6]. 
Notice that if char(k) A 2,3, then everything about EF (reduction type, 
exponent of conductor, group of components E(/)/E (4)) can be read 
off from Table 4.1 once one has a minimal Weierstrass equation for E/K. 


Our description of Tate’s algorithm follows very closely Tate’s exposi- 
tion [2]. The idea is to begin with an arbitrary Weierstrass equation 


E:y? +ayry + a3y = 2? + apr? + aga + a6 


for E/K and manipulate it to produce a minimal proper regular model €. 
Once we have this model, we will be able to read off all of the information 
we want. As we go along we will be making various assumptions. These 
assumptions are cumulative, and will be [boxed] for clarity. We will 
delay the proofs of the various steps until after describing the complete 
algorithm. 

Making a change of variables, we may assume that the Weierstrass 
equation has coefficients fax, G2,43,44,46 € RI. We let 


bg =a? + dao, ba = a)4a3 + 2a4, bg = a3 + dag, 
bg = ajag + dagag — a14304 + a203 — a4 = (bebe — 64)/4, 
A = —b3bg — 8b} — 272 + 9bobsbe 
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be the usual quantities [AEC, III §1] associated to the given Weierstrass 
equation. 

It is not necessary to assume that the original Weierstrass equation is 
minimal. When the algorithm terminates, the resulting Weierstrass equa- 
tion will be minimal, so its discriminant will equal v(Dz¢/x). Further, we 
will see that the smooth part of the final Weierstrass equation, considered 
as a scheme over R, is the identity component of C, which will prove (9.1). 


Step 1. Ifa { A, then the special fiber é/k is an elliptic curve, and we 
have 
Typelh, v(A)=0, m=1, f=0, c=1. 


Step 2. Assume [|A]. This means that E has a singular point. 
Make a change of variables to move the singular point to (0,0). Then 
[7laz, a4 and ag]. If x ¢ bo, then we have Type I,, with n = v(A). More pre- 
cisely, let k’ be the splitting field over k of the polynomial T? + a,T — az. 
Then we have 


Typel,, v(A)=n2>1, mean, f=l. 
Farther, if k’ =k, then E has split multiplicative reduction, 
€% (k) &k* and c=n; 
whereas if k’ #4 k, then F has non-split multiplicative reduction, 


1 if n is odd, 
2 ifn is even. 


From now on, £ has a cusp and |€°(k) & kt]. We are going transform 


the Weierstrass equation so as to make the a;’s more and more divisible 
by 7. To keep track, we introduce the convenient notation 


&%k) &{aek’: NE (a)=1} and c= { 


Air = 7 ' Aj. 


Step 3. Assume now that [7]b2].- If 7? { ag, then 
Type, m=1, f=v(A), c=. 
Step 4. Assume that (which implies that 1?|bg and 7?|bg). If 
mw? t bg, then 
Type ll], m=2, f=v(A)-1, c=2. 
Step 5. Assume that (which implies that 7?|b4). If 73 { bg, then 
Type IV. Let k’ be the splitting field over k of T? + a3.1T — ag2 = 0. Then 


3 if k’ =k, 


Type IV, m=3, f =v(A) -2, = if k’ Ak. 
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Step 6. Assume that ae Then we can change coordinates to get 


nla, and aj, 7/a3 and ay, and 7 lag]. 


More precisely, the boxed assumptions up to this point show that we can 
factor 


Y?+a,Y — a2 =(Y —a)? (mod zr), 
Y? +.a31Y —a62 =(Y — 8)? (mod nr), 


and then the substitution y’ = y + ax + Br will have the desired effect. 
Having done this, we consider the factorization over k of the polynomial 


P(T) = T? + a21T? + aa2T + 06,3. 
To assist in explicit computations, we note that P has discriminant 
Disc(P) = 1~°(—4aag + a3a2 — 4a} — 2702 + 18a2a4a¢). 
If P(T) has distinct roots in k (i.e., if 7 { Disc(P)), then 
Typel§, m=5, f=v(A)—4, c=1+#{aek: P(a) =0}. 
Step 7. If P(T) has one simple root and one double root in k, then 
Type I*, m=n+5, f=v(A)-4-n, c=2o0r4. 


If p £2, then n = v(A) — 6, so m = v(A) — 1 and f = 2. For arbitrary p, 
one can calculate the values of n and c using the following subprocedure 
to Step 7. 


Step 7. (Subprocedure) Translate x so that the double root of P(T) is T = 
0. Then 1? f aa, m a4, and r|ag. If the polynomial Y? + a3,2Y — a¢,4 has 


distinct roots in k, let k’ be its splitting field. Then 


, _ 7 7 _J4 ifk’=k, 
Type lt, m=6, f=uld)-5, e={5 if k £k. 


If Y? +.a3,2Y —a¢,4 has a double root in k, translate y so that the root 
is Y =0. Then 7?|ag3 and 7° lag. If the polynomial a2,,X? + a43X + a6,5 
has distinct roots in k, let k’ be its splitting field. Then 


4 if k’ =k, 
2 if k’ #k. 


Type ls, m=7, f =v(A)—6, e=4 
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If ag, X74 a4,3X + 46,5 has a double root in k, translate x so that the 
root is X = 0. Then mw laq and 7®lag. If the polynomial Y? + a33Y — 266 
has distinct roots in k, let k’ be its splitting field. Then 


Regge. 
Type I3, m=8 f=v(A)-7, c= . ae 

If Y? +a33Y — ag,¢ has a double root in k, etc. Continue this proce- 
dure until the quadratic polynomial which appears has distinct roots in k. 
The process will terminate because after each two steps we have will have 
forced a3, a4, and ag to each be divisible by at least one additional power 
of 7. This means that b4, bg, and bg are also divisible by at least. one addi- 
tional power of 7, and hence the same is true of A. But the discriminant A 
is invariant under all of the translations involved, so the process will stop. 
Step 8. Suppose now that P(T’) has a triple root in k. Making a trans- 
lation on x, we may assume that the root is T = 0, which means that 
[n?|a2, m?\a4, and 74\ag|. If the polynomial Y? + a3 2Y — a¢.4 has distinct 
roots in k, let k’ be its splitting field. Then 


3 ifk’ =k, 


Type IV*, m=7, f=v(A)-6, o= {3 ifk’ £k. 


Step 9. Suppose now that Y?-+a3 2Y —ae.4 has a double root in k. Making 
a translation on y, we may assume that the root is Y = 0, which means 
shat: (las onda lac). Tha! hau, tlien 


Type lll*, m=8, f=v(A)-7, c=2. 


Step 10. Suppose that [74 ]a4]. If 7° { ag, then 


Type ll*, m=9, f=v(A)-8, c=1. 


Step 11. Finally, suppose that ae Then the original Weierstrass 
equation was not minimal. The substitution (x,y) = (1?2’, 73y’) leads to 
the equation 


72 toe , 13 72 ! 
Y +a1vy +43 3Y =X +4220 +4444 + 466 


with coefficients in R and discriminant A’ = 77!*A. Go back to Step 1 
and begin the algorithm again with this new equation. Note that we can 
only get to Step 11 a finite number of times, since each time we get here, 
the discriminant of the original Weierstrass equation must be divisible by 
an additional factor of 71°. Therefore the algorithm will terminate. 
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This concludes our description of Tate’s algorithm. We are now going 
to give some indication of why the various steps in Tate’s algorithm yield 
the stated conclusions. The idea is to start with the given Weierstrass 
equation and perform a sequence of blow-ups to produce a minimal regular 
model for E. In practice, we will really only need to carry out the blowing- 
up process until we are able to recognize which type of fiber is emerging. 

Let W c P% be the scheme defined by the given Weierstrass equation, 
and let W°/R be the largest subscheme of W that is smooth over R. In 
other words, W® is formed by removing from W all singular points (if any) 
on its special fiber W. Just as in the description of Tate’s algorithm, we 
will put a box around cumulative assumptions as we make them. 


Proof of Step 1. The condition a { A means that the special fiber W is 
non-singular, so W itself is smooth over R. Hence € = C = W, which shows 
that the special fiber is of Type Io. 

We assume now that [ld], which means that the reduction E has a 


singular point. (Equivalently, the special fiber W is singular.) Making a 
linear change of variables, we may assume that the singular point is (0, O)eE 
E. This means that if we write 


flay) =y? tary + azy x agr a4v — ae, 


then f(0,0) = 0 (mod 7), and further both partial derivatives (Of /Ox) (0,0) 
and (Of /Oy)(0,0) vanish modulo 7. Hence [wla3, a4, a6]. 


Proof of Step 2. This is the case that EF has multiplicative reduction. 
We are going to leave it to the reader (exercise 4.37) to perform the blow- 
ups necessary to resolve the singularity in this case. At the end of this 
section (9.6) we will briefly explain another approach to analyzing mul- 
tiplicative reduction using Tate’s analytic models for elliptic curves over 
complete local fields. We will also prove the following lemma which covers 
Types I, and II. 


Lemma 9.5. Let R be a discrete valuation ring with fraction field K, 
let E/K be an elliptic curve given by a Weierstrass equation 


y? +ayry+azy = x + aor" + a4xu + ag 


with coefficients in R, let W C P%, be the R-scheme defined by this equation, 
and let W°/R be the largest subscheme of W that is smooth over R. 

(a) Ifv(A) = 1, then W is regular, C = W, and € = W®. The curve E has 
Type I, reduction. 

(b) If rla3,a4,ag and 7 | ag, then W is regular, C = W, and € = W°. The 
curve £ has Type I, reduction if 7 { bo, and Type II reduction if r|bo. 


PROOF. (a) As described above, the fact that 7|A means that we can make 
a linear change of variables to get m|a3,@4,a6. This implies that m|b1, bg, bg. 
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If we make the assumption that v(A) = 1, then we must have v(bg) = 1, 
since all of the other terms in the formula for A are divisible by at least 7. 
Writing bg in the form 


bg = bza6 — a1a304 + a2a3 — at = beag (mod mn), 


we find that if v(A) = 1, then v(ag) = 1 and 7 f bo. 

We now drop the assumption that v(A) = 1 and prove that W is 
regular assuming only that \a3,@4,a¢ and 7? { ag. This will verify the 
first statements in both (a) and (b). We need to prove that W is regular 
at the singular point (0,0) € W on its special fiber. In other words, if we 
let m = (7,2, y) be the maximal ideal corresponding to the singular point 
on the special fiber, then we must show that the local ring of W at m, 


= Riz, y|m 
(y? + aizy + agy ~— 23 — agx? — aye — ag)’ 


Ow.m 


is a regular local ring. By assumption, v(ag) = 1, so ag is a uniformizer 
for R. On the other hand, ag is in the ideal of Ow,m generated by x and y, 
since 


ag = y? +a xy + agy — 2° — az” — age € LOW mt YOw mn 


Therefore the maximal ideal (7,2, y) of Ow,m is generated by the two el- 
ements x and y, so Ow.m is a regular local ring. This proves that W is 
regular, and since it is clearly also proper over R, we find that C = W 
and € = W°. This proves the first part of (a) and (b). 

The special fiber @ = Wis the curve 


y” + ary = x? + Gox? 


in A?. It will have a node (respectively cusp) at (0,0) if the quadratic 
form y? + Gry — Ggx? has distinct roots (respectively a double root) in k. 
The discriminant of this quadratic form is 47 + 4@2 = bs, so W has a node 
if 7 { bg and a cusp if w/bo. By definition, the special fiber is of Type I, 
if it has a node, and of Type II if it has a cusp, and we saw above that 
if v(A) = 1, then 7 { 62. This completes the proof of (9.5). o 


Continuing on past Step 2, we now assume that [7|b2]. Notice that 
bg is the discriminant of the quadratic form y? + a,ry — agx?, so this form 
has a double root in k, say 


y? + a,xry — ag? = (y—axr)? (mod zr). 


The substitution y — y + az allows us to assume that lax, a2]. Notice 
that this substitution leaves the other a;’s and all of the 6;’s unchanged. 
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Proof of Step 3. We are given that m|a3, a4, a6, bz and that 7? { ag. This 
is exactly the situation in (9.5b), which shows that the special fiber is of 
Type II. 


Proof of Steps 4 and 5. We now add the assumption that [7 [a6], and 
recall that our model satisfies 7|a1,a2,@3,a4. We are going to blow up the 
singular point 7 = x = y = 0 of W using the procedure described in (7.7). 
Thus the blow-up consists of the following three coordinate charts glued 
together in an appropriate fashion: 


2 eee. 2 
Wii yp + a1at1y + a31y1 = WL] + 2x] + G42) + 46,2, 


/ ! 4 12 rot 72 nor 
W :1+aia’ +a3.2 =a" y' +02" +a417'c' Fagor", Wy =7, 


u" 2 " now ” ” 2 wow 
Wy +ay +a3in yo =X +a9+a4i1T +ag27 °, TL =T. 


Note that we are using the notation a;,, = m~"a; introduced earlier. 

Looking at the special fibers of each of W,, W’, and W”, it is easy to 
verify that the special fiber of W” contains all of the components and all of 
the singular points of the special fiber of the blow-up. Thus all of the action 
will be happening on W”. Further, it is not hard to see that the projec- 
tion A}, + Aj}, induced by the natural inclusion R[y”, 7] — R[x", y”,7”] 
maps W” isomorphically to the subscheme of AZ = Spec R[y”,7”] given 
by the single equation 


2 oy now Wen ” 2 3 
YoU +7Q,1yY 7 +a31y TH =TA+74217 +4417 +4627. 


(The map in the other direction is 2” > y”* + ayy” + agim"y” — ag — 
a447" = agon”””.) 
We next take the closure of this scheme in Pees This means we homog- 
enize 
y =Y/X and HO ZX, 


which yields the scheme V Cc P?, = Proj R[X,Y, Z] given by the equation 
Vi Y¥?Z + ma XYZ +031:YZ? = 7X3 4 mag. X7Z + ag XZ? +06 22°. 


Notice that V is a model for E/K, since its generic fiber is isomorphic 
over i to the original Weierstrass equation defining E. 

To find the special fiber of V, we set 7 = 0, so V is the curve in A? 
given by the equation 


Vo? bagi VZ Gi XZ ~ae977)Z =O: 
Thus the special fiber consists of the line Z = 0 and the (possibly degen- 


erate) conic 
Y? + @31VZ — GgoZ7 = G4 XZ. (x) 
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This line and conic intersect at the point Y = Z = 0 with multiplicity 2. 
Suppose first that 7? { bg, which is the condition for Step 4. The 
formula defining bg is 


2 
bg = ajag + 4a2a6 — a1a3a4 + aza3 - ai, 


and our cumulative assumptions imply that every term except the last one 
is divisible by 7°. Hence 7° + bg if and only if 7? { a4, and this in turn 
is equivalent to the assertion that (*) is a non-singular conic (as opposed 
to being two lines). So 1? | bg implies that V consists of two non-singular 
rational curves intersecting at a single point with multiplicity 2, which is 
exactly Type III. 


Next assume that [73 bs], or equivalently that 7?|a,. This means that 
the special fiber V is given by the equation 


(Y? +43: YZ —G62Z7)Z =0, (+x) 


so over k it consists of three lines. These lines will be distinct if and only 
if the quadratic form Y? + @31YZ — G@g2Z? has distinct roots, which is 
equivalent to the condition that its discriminant az, 1 + 446.2 = = be, 2 does 
not vanish. So if we assume that 7? { bg, which is exactly the condition for 
Step 5, then V consists of three non-singular lines intersecting transversally 
at a single point, which is a fiber of Type IV. Further, the number c of 
components defined over k will be 3 if Y¥*+431YZ —a6 22? splits into linear 
factors over k and will be 1 otherwise. This completes our consideration of 
Steps 4 and 5 of Tate’s algorithm. 

We now assume that [73 |b). This means that the quadratic form 
in (**) has a double root in k. Making a translation Y —- Y + BZ moves 


the double root to Y = 0. We now have |z|a1, a2, 7\a3,a4, and 73|a¢|. 


The equation for V can be written as 


V:Y?Z4ma1.XY Z+n032Y Z* = 1X? +4+2091X°Z+7042X Z? +7106,32", 


and its special fiber Y*Z = 0 consists of the line Y = 0 with multiplicity 2 
and the line Z = 0 with multiplicity 1. 

The next step is to blow-up the double line 7 = Y = 0. To ease 
notation, we are going to dehomogenize at the same time, so we set 


X=m, Y=ry, Z=1 


and divide the equation for V by a. (The reason for the subscripts on 2 
and y2 is that they are related to our original Weierstrass coordinates by 
the formulas « = 72, and y = 7*yp. For the rest of this proof we will use 
the notation x = 7x, and y = 1y,.) 
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We now have the scheme 
Vo: TYS + TAL Liye + 743,242 = x} + 2127 + G4221 + 46,3, 


and the total blow-up consists of V and Vo glued together in the natural 
way. The special fiber of the total blow-up is thus formed by gluing together 
the two pieces 

Vere), 


< Byatt) 9 ~ 
Vo :0= Ui + 2121 + Ga2%1 + 46,3. 


There are now three cases to consider, depending on the number of distinct 
roots (in k) of the polynomial 


P(T) = T? + GaiT? + GgoT + G3. 


Proof of Step 6. For Step 6 we assume that P(T) has distinct roots 
in k. Then Vo consists of three distinct lines, so the blow-up is composed 
of the double line Y? = 0 together with four lines of multiplicity 1 in- 
tersecting it. This means we have a fiber of Type I}. Further, there is 
always one component Z = 0 of multiplicity 1 defined over k, and the 
other multiplicity-1 components correspond to the roots of P(T). Hence 
the number c of multiplicity-1 components is one more than the number of 
roots of P(T) in k. 


Proof of Step 7. For Step 7 we assume that P(T) has one simple root and 
one double root. Making a translation of the form 7; — x; + y, we may 
assume that the double root is T = 0, which implies that 2? + az, maa, 
and 7*]ag. The special fiber of Vo is now 


Vo i (21 + a21)r} = 0, 


so we need to blow-up Vo along the double line 7 = x; = 0. To do this, we 
make the substitution 21 = 7x2 and divide by 7 to obtain the scheme 


ate Did 2 
Vi > Yo + TQ11X2Y2 + a3,2Y2 = TN Ly + TA2,1L9 + TA4,3%2Q + a6,4- 


Our total special fiber is now composed of the following components: the 

simple lines Z = 0 and x; + G2; = 0, the double lines Y = 0 and 2; = 0, 

and the special fiber of V,. Notice how a fiber of Type I* is emerging. 
The special fiber of V; is 


Vi: ys + @3.2yo — G64 = 0. 


If this quadratic equation has distinct roots in k, then Vi consists of two 
distinct lines, and we have a fiber of Type Ij. Further, there are already 
two multiplicity-1 components defined over k, namely the lines Z = 0 
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and #1 +421 = 0, soc = 4 if the polynomial y3 + G3 2y2 — ae,4 has its roots 
in k, and c = 2 otherwise. 

If the polynomial ae + 43,2y2 — @e,4 has a double root, then making 
a translation on yg allows us to take the double root to be y2 = 0. This 
means that m*|a3 and m°|ag, and the special fiber of V; is yg = 0. We 
blow-up V; along this double line by making the substitution y2 = my3 and 
dividing by 7, which gives the scheme 


eo) 3 2 
Vo: TWy3 + 7a, 1 %2y3 + 143.343 = TLD + 42175 + a4,302 + O65. 
The special fiber of V2 is 
ena hea 
Vo: 42,123 + 04,3%2 + 465 = 0. 


If this quadratic equation has distinct roots in k, then V2 consists of two 
distinct lines, we have a fiber of Type I5, and we’re done. Otherwise the 
quadratic equation has a double root and V2 is a double line, so we translate 
to make the double line ae = 0, blow it up using rg = 7x3, and continue 
on our merry way. 

As explained during the description of the Step 7 subprocedure, this 
process will eventually terminate. The point is that the special fiber at 
each stage looks like 


~ Yo + A3.vYv — Gear = 9 ifn = 2v — 3 is odd, 
Vv : 2 = a“ < ‘ 
42105 + Gay412) + A6.241 =O ifn = 2v — 2 is even. 


So each two steps of the algorithm force a3, a4, and ag to be divisible by 
an additional power of 7. This implies the same for 64, bg, and bg, and 
hence also for A. But A is invariant under the various translations we are 
using, which shows that eventually we must get a quadratic polynomial 
with distinct roots. We will leave for the reader the easy verification that 
if the residue characteristic p 4 2, then the fiber V,, consists of two distinct 
lines precisely when n = v(A) — 6. This concludes our discussion of Step 7 
of Tate’s algorithm. 


Proof of Step 8. We now assume that the polynomial P(T) has a triple root 
in k, which after a translation we can take to be T =0. This means that 
[n?\a2, mag, and tlae|, so the special fiber Vg is the triple line x? = 0. 
Our total special fiber now consists of the simple line Z = 0, the double 
line Y? = 0, and the triple line z} = 0. The scheme Vg is regular except at 
the points on the special fiber satisfying 


2 
T= 21 = yo + 43,.2Y2 — G64 = 0. 


Making a translation y2 — y2 +7 allows us to assume that the polynomial 
ys + a3,2y2 —a6.4 has yo = 0 as a root in k. This may require making 
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a quadratic extension of k, in which case the components of the special 
fiber corresponding to the two roots of the quadratic polynomial will not 
be defined over k. 

We now have [75 |a6], and we blow-up Vo at the point 7 = 7] = yo = 0 
by making the change of variables 


rt rt ! 
T=TY, T1+=2Y, y=y. 


(This is chart 2 of the blow-up as described in (7.7).) This yields the 
scheme U’ C A}, = Spec R[n’, x’, y’] given by the equations 


3 2 tot 72 Hed 
UWoiat Tay 4x" + 3,97" =z! y' + Ta2,2x' +4432 7 +4657 , T=TY. 
The special fiber of U’ consists of three components, which we label as 
Fi: =a’=0, Fy:n=y =0, Fy: y' = 432 — 4432’ — dg 5n’ = 0. 


Notice that when we glue U’ to Vo, Fj is identified with the mul- 
tiplicity-3 component z? = 0 of Vo. Our next step is to compute the 
multiplicities of the new components F3 and F3. To do this, we rewrite the 
equation for WU’ as 


1S, rot 1207 ’ ! poy 
7 {y + a1 y +432 —-a2,2% y — G4 3X — ag 57 } =xry. 


The function x’ does not vanish identically on F%, so it is a unit in the local 
ring Of. Similarly, since we are making the Step 8 assumption that wt? { as, 
the quantity in braces is also a unit in O;,. It follows that both 7’ and y’ 
are uniformizers for O FS that is, they each vanish to order 1 on F%, so 


ord py (U’) = ord gy (1) = ordgy(n’y’) = 2. 


We leave for the reader the analogous verification that ord py (W) =1. 

But we’re not done with Step 8, because we have to perform an iden- 
tical blow-up of Vo at the singular point 7 = 21 = y2+a3,2 = 0. This gives 
another pair of components, one of multiplicity 2 and one of multiplicity 1. 
The resulting configuration is of Type IV*, which completes the verification 
of Step 8 of Tate’s algorithm. 


Proof of Step 9. For this step we have [ras], so the scheme U’ given 
above is singular at the point 7’ = xz’ = y’ = 0. We blow it up at that 
point by making the substitution 


nr’ 0 wa", q! seis x”, y! fe ye” 
This gives the scheme U” C A}, = Spec R[r”, x", y"] defined by the equa- 
tions 


ut woot Wt toot 2 
Wy” + aye yn" +.43.32"y" Tr 


N24 m2 now " 2 
=X YY +4220 yu +4437 +4657 , 
Gee vy nT". 
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Under our Step 9 assumption that 74 | a4, we find that UW” consists of the 
following four components: 


MoM a 
fy igci=a7 =0, Fy :y =7 =0, 
He : a" = y" = G4,3 = Gg.5m” = 0, 


Fi : y” = 443+ ag.5m” = 0. 
We compute the multiplicity of F// in the fiber by writing 


” ” wow wow ow 2 oo " nn 
T {y +a,10 Y +4330 YT — 42290 YyY — 44,3 — A657 pas y. 


The function in braces does not vanish identically on Fj’, so it is a unit 
in Op. This means that x” is a uniformizer for Fj’ and ordpy (1) = 


2ord gy (x) = 2. Hence 
ord pv (U") = ordyr (1) = ord py (a? y"n") =4, 


So we now have a chain of components of multiplicities 1, 2, 3, and 4. 
Notice how the fibers of Type ITI* and II* are emerging. 

A similar calculation shows that ord py (U’) = 2. Further, our Step 9 
assumption that «+ { aq implies that UW’ is regular at the point 7” = 
xz” = y” = 0 where Fy! and F%’ intersect. Hence there is a multiplicity-2 
component attached to the multiplicity-4 component of the regular minimal 
model. This means that the fiber is of Type III*, which completes our 
analysis of Step 9. For those who wish to recover the full Type III* fiber, 
we mention that U” is singular at the intersection of F and F3, that is, at 
the point 7” = a2” = y” —a43 =0. 


Proof of Step 10. We now assume that [74 [a4]. Then U” is singular at 
the point 7” = 2” = y” =0, so we blow it up using the substitution 


n" Woe W Wow uy a 
T= = : 


Tey; c=2 YY , y =yY 


This gives the scheme 


mw mt Wot oan mr m2 m2 
Ui taney Ww +4330 Yo 7 


m2 m2 2 an m2 yn yy72 m2 
=X -Y +42 90° YW +G440 yom +4657 , 


2 Amn 
raw! y 1A 


The special fiber U’” consists of four components, 


sth wa mw JT if armen 
Fooig’ =n" =0, Fy iy =7 =0, 


Fy’ : gill! = 1 Tas Gig. 7 = 0, Fy’ : yl = 1 _ ag.57” = 0. 
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aw 


We compute the multiplicity of F;” in the special fiber by writing 


m” mow mw an m2 172 
wf +a oy” + a3ge ya" — agan" y 

2 3 2 

~ag gc!" yl” ll deen! Aix glll y”". 


Both z’” and the quantity in braces are units in Op”, which shows that 7’ 
and y’” each vanish to order 1 on Fy’. This allows us to compute 


ord py (U") = ord py (m) = ord py (aly! an) = 5. 


Further, the Step 10 assumption that 7® { ag implies that U’” is regular 
at the intersection point 1” = a!” = y’” = 0 of Fi” and Fy”. Hence the 
appearance of the multiplicity-5 component F” tells us that the fiber is of 
Type II*. As usual, we leave for the reader the enthralling task of perform- 
ing the additional blow-ups necessary to find the other I[* components. 


Proof of Step 11. Finally, suppose that |7®lag}. Our cumulative assump- 
tions to this point are that ma, 77]a2, 7°|a3, m*\a4, and 7°|ag. In UW” we 
make the substitutions 


mn" 2 mw 2 3 me 2 
nm = 1/Ty3, gp = 23/TY3 y= Ty /Z3, 


which leads to the R-scheme 
¥3 + 1 122Y3 + 3,393 = 3 ts 2,223 + G4,402 + G66 


defined by a Weierstrass equation whose discriminant is 7~!*A. We can 
now begin again at Step 1 using this “smaller” Weierstrass equation. Note 
that each time we pass through Step 11, we will have shown that the original 
discriminant is divisible by an additional 71*. Therefore the algorithm will 
terminate. This concludes the proof of Tate’s algorithm (9.4). Oo 


Remark 9.6. During our verification of Tate’s algorithm (9.4), we left 
the case of multiplicative reduction (Type I,,) for the reader to analyze. 
There is another approach to multiplicative reduction using Tate’s p-adic 
analytic uniformization. We will describe Tate’s uniformization in the next 
chapter (V.3.1, V.5.3), but briefly, if & has split multiplicative reduction 
and K is a complete local field, then there is a g € K* with v(q) = v(A) > 0 
and an isomorphism of groups 


K*/q? — E(K). 


This isomorphism is given by v-adically convergent power series. Further, 
the isomorphism identifies the subgroups R* = Eo(K), so we get isomor- 
phisms 

E(K)/Eo(K) ~> K*/q2R* > Z/nZ. 
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Here the second map is induced by the valuation v : K* — Z, and 
n= v(q) = v(A). 


We know that E(i) = €(R), and it is clear from the definitions 
that Eo(K) = W®(R). Further, one can show that W°(R) = €°(R), ei- 
ther by a direct calculation or using the argument in (V §4). It follows 
that €(R)/E°(R) & Z/nZ, and then (9.2b) gives c =n. 

Similarly, if E has non-split reduction, then (V.5.4) says that we can 
find an unramified quadratic extension A’/K with residue field k’ such 
that FE has split reduction over K’. Then 


BUR") & K" /a®, 
E(K) = {we K"/q" s NE (u) € 7/07}, 
E)(K) = {ue R* : NE (uw) =1} 2 {uek™ : NE(u) =1}. 


Note the last isomorphism depends on the fact that K'/K is unramified. 
Finally, we have 


E(k) /E°(k) © €(R)/E°(R) = E(K)/Eo(K) = (NE')-1(q")/q2 (NE )71(1). 


The fact that K’/K is unramified means that the norm map is surjective on 
units, NK : R’/* + R*, from which one easily deduces that this last group 
is trivial if n is odd, and has order 2 if n is even. 


Proor (of Corollary 9.1). If we start with a minimal Weierstrass equation 
for E/K, then we never get to Step 11 of Tate’s algorithm, so the original 
equation defining W never changes. Tracing through the various stages 
of Tate’s algorithm, we see that the non-singular part W® of the Weier- 
strass equation ends up as an open subset of the minimal regular model €. 
Since W® clearly contains the image of the zero section, and since the spe- 
cial fiber W is irreducible, we see that W is the identity component of the 
special fiber of €C. Equivalently, W° = €°. (For an alternative proof of (9.1) 
which uses a bit more algebro-geometric machinery and does not rely on a 
case-by-case analysis, see Liu [1].) qo 


ProorF (of Corollary 9.2). (a) First, the equality E(k) = €(R) follows 
from the definition of the Néron model (5.1.3). Next we observe that the 
definitions of Eg and W® are both given in terms of the reduction of the 
given Weierstrass equation, so Ey(K) = W®°(R) is automatic. Now (9.1) 
says that W° = €°, so we get the middle equality Eo(K) = €°(R). Fi- 
nally, E\(4) and €'(R) each consists of the points which reduce to the 
identity on the special fiber, and these reductions are compatible since 
we already know that Eo(K) = €°(R). This proves the third equal- 
ity E,(K) = €1(R). 
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(b) The isomorphisms E(K)/Eo(K) © €(R)/E°(R) and E(K)/E\(K) = 
€(R)/E'(R) are immediate from the identifications proven in (a). Further, 
the reduction map €(R) — €(k) has kernel €'(R), and the inverse image 
of €°(k) is €°(R) by definition, which gives the injectivity of the right-hand 
maps. Finally, if R is complete (or merely Henselian), then the reduction 
map €(R) > €(k) is surjective (6.4a), so in this case the right-hand maps 
are isomorphisms. i 
(c) We have Ens(k) = W°(k) directly from the definitions. Now (9.1) 
implies that W°(k) = €9(k), which gives the desired result. 
(d) From (b) we have an injection E(K)/Eo(K) — &(k)/€%(k). The 
group €(k)/E°(k) is formed by looking at the special fiber @ of the minimal 
proper regular model and taking the components that have multiplicity 1 
and are defined over k. A quick perusal of the list of reduction types shows 
that only Type I,, has more than four multiplicity-1 components. Further, 
Tate’s algorithm (Step 2, see also (9.6)) says that if a fiber of Type I, has 
non-split reduction, then it has at most two components defined over k. 
This proves that E(K)/Eo(i) has order at most 4 unless E/K has split 
multiplicative reduction. Finally, if £/K has split multiplicative reduction, 
say with a Type I, fiber, then Step 2 of Tate’s algorithm says that n equals 
the valuation of the minimal discriminant, which is also equal to —v(j(£)). 
O 
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The conductor of an elliptic curve E/K is a quantity which measures the 
arithmetic complexity of £/K, similar in some ways to the minimal dis- 
criminant. Just like the discriminant, the conductor is a product over the 
primes p at which # has bad reduction, but the exponent of p is defined 
in terms of the representation of the inertia group on the torsion subgroup 
of &. The conductor is an important quantity which appears in the func- 
tional equation of the L-series of &, in the modular parametrization of 
elliptic curves over Q, and in various questions concerning the cohomology 
of E. 

Before defining the conductor, we briefly recall some standard facts 
about local fields. For more details, see Serre [4]. Let K be a local field of 
residue characteristic p, let L/K be a finite Galois extension with normal- 
ized valuation vy; and ring of integers R;, and let G(L/K) be the Galois 
group of L/K. Then for each integer i > —1, the i'-higher ramification 
group of L/K is the subgroup of G(L/K) defined by 


Gi(L/K) = {0 € G(L/K) : v1 (a? —a) >i +1 for alla € Ry}. 
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We write 
9(L/K) = #G,(L/K) 


for the order of the i**-ramification group. One should think of the higher 
ramification groups as measuring the extent to which the extension L/K 
is wildly ramified. The following lemma records some basic facts about the 
higher ramification groups. 


Lemma 10.1. Let L/K be a finite Galois extension of local fields. 

(a) The higher ramification groups G;(L/K) are normal subgroups of 
G(L/K). 

(b) G-a(L/K) = G(L/K). 

(c) Go(L/K) is the inertia group of L/K. 

(c) [Go(L/K) : Gi(L/K)] is relatively prime to p. 

(d) Gi(L/K) is a p-group. Thus L/K is wildly ramified if and only 
if Gy (L/K) #1. 


PROOF. See Serre [4], especially Chapter IV, Proposition 1 and Corollar- 
ies 1 and 3 to Proposition 7. Oo 


The conductor of an elliptic curve consists of two pieces, a tame part 
and a wild part. It turns out that if the residue characteristic p is at 
least 5, then the wild part will be zero. So if one is willing to ignore residue 
characteristics 2 and 3, then 6(£/K) can just be set equal to 0 in the 
following definition. 


Definition. Let E/K be an elliptic curve defined over a local field of 
residue characteristic p, and let I(K/K) be the absolute inertia group 
of Kk. Fix a prime @ different from p, let Ve(E) = T(E) @z, Qe be the & 
adic Tate module of E, and write V;(E)/‘*/*) for the subspace of Vp(E) 
that is fixed by I(K/K). The tame part of the conductor of E/K is the 
quantity 


e(E/K) = dimg, (Vé(E)/Vi(E)*/®) = 2 - dima, (Vi(E*/™). 


Next let L = K(E[é]). Then the wild part of the conductor of E/K is the 
quantity 


6(E/K) = Patan dimg, (24 Ee) 


The exponent of the conductor of E/K is the sum of the tame and wild 


parts, 
f(B/K) =e(E/K) + 6(E/K). 


The conductor is a representation-theoretic quantity, since it is defined 
in terms of the action of the Galois group G(K /K) on the torsion subgroup 
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of E. The following generalization of the criterion of Néron-Ogg-Shafare- 
vich [AEC, VII.7.1] provides a geometric interpretation for the tame part 
of the conductor. We note that in many books (including [AEC, C §16]) 
one finds this geometric description (10.2b) used as the “definition” of the 
conductor. 


Theorem 10.2. Let K be a local field of residue characteristic p, and 
let E/K be an elliptic curve. 
(a) The tame part of the conductor of E/K is given by 


0 if E has good reduction, 
e(E/K)= 4 1. if E has multiplicative reduction, 
2 if E has additive reduction. 


(b) If E/K has good or multiplicative reduction, or if p > 5, then 


0 if E has good reduction, 
6(E/K)=0 and f(E/K)= 41. if E has multiplicative reduction, 
2 if E has additive reduction. 


(c) In all cases, the exponent of the conductor f(E/K) is an integer which 
is independent of the choice of &. 


PROOF. (a) Notice that 


e(E/K) =0 = V,(E)*/®) = V, (EB) 
<=> 1(K/K) acts trivially on T(E) 
<=> T;(£) is unramified. 


So the assertion that 
e(E/K) =0 <> E/K has good reduction 


is precisely the criterion of Néron-Ogg-Shafarevich [AEC, VII.7.1]. We are 
going to mimic the proof of [AEC, VII.7.1] to obtain a somewhat stronger 
result. This proof is taken from Serre-Tate [1]. 

Let K™ be the maximal unramified extension of AK, and consider the 
two exact sequences 


0 — Ej(K™) — E(K™) —> E(K™)/Eo(K™) — 0, 


0 — E(k") — E(K™) — Eys(k) — 0. 


Here k, the residue field of K®", is the algebraic closure of the residue field 
of K. We note that E(AK™)/Eo(K™) is a finite group from (9.2d), and 
that £,(4™") has no &torsion from [AEC, VII.3.1], since it is isomorphic 
to the formal group of E. 
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For any abelian group A, we let T)(A) denote the Tate module of A, 


T,(A) = lim A[é"], 
and we set 


Vi(A) = Te(A) @z, Qe. 


We observe that V;(A) will be 0 if A has no ¢-torsion, or if A is a finite 
group. In particular, ° 


Ve (E(K™)/Eo(K™)) =0 and Ve(Ei\(K™)) = 0. 
Hence the two exact sequences given above yield isomorphisms 
Ve(E(K™)) 2 Ve(Eo(K™)) + Ve(Bns(R)). 
On the other hand, we clearly have 


Vi(E(K")) = Ve(E(R)) OR = vB), 


which proves the fundamental isomorphism 


I(K/K) 


Ve(E(K)) = Vi (Ens(K)). 


Now we compute 
e(E/K) = 2 —dimg, (Ve(E)KRHO) 


= 2-dimg, (ve (Ens(R))) 


dimg,(Ve(E)) if E has good reduction, 
= 2-4 dimg,(Ve(k*)) if E has multiplicative reduction, 
dime, (Ve(k*)) if E has additive reduction, 


where the last line follows from the standard description of the various 
reduction types [AEC, VII.5.1]. Using the fact that £4 p, we find that 


ViA(E)=Q?, Volk") Qe, = Ve(K*) = 0, 


which completes the proof of (10.2a). 
(b) Fix a prime ¢ 4 p, and let L = K(E[é]). If E/K has good reduction, 
then L/K is unramified from [AEC, VI1.4.1], so the inertia group Go(L/K) 
is trivial. It is then clear from the definition that 6(E/k) = 0. 

Next suppose that E/K has non-integral j-invariant, that is, vx (jz) < 
0. Let £4 p be a prime. We will prove later (see (V.5.3) and exercise 5.11) 
that there is an extension K’/K with the following properties: 
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(1) [K’: K] =1or 2. 

(2) K’/K is unramified (respectively ramified) if E/K has multiplica- 

tive (respectively additive) reduction. 

(3) There is a g € K’ such that K’(E[Z]) = K'(ue,q'/*). 

Here pe denotes the group of £**-roots of unity. 

Thus the extension K’(E[é¢])/K’ is composed of an unramified cyclo- 
tomic extension K’(pig)/K’ and a Kummer extension K'(ue, q'/*)/K’ (pe) 
whose order divides £. This shows that the extension K’(E[é])/K’ is 
at worst tamely ramified. It follows from properties (1) and (2) that 
K(E(4)/K is tamely ramified if either E/K has multiplicative reduction 
or if p > 3, which completes the proof of (10.2b) in the case that jr is 
non-integral. 

Finally we consider the case that E/K has integral j-invariant, or 
equivalently from [AEC, VII.5.5], E has potential good reduction. A key 
tool in proving (10.2) in this case is the following strengthening of the 
criterion of Néron-Ogg-Shafarevich. 


Proposition 10.3. Let K be a local field of residue characteristic p, and 
let E/K be an elliptic curve with integral j-invariant. 
(a) The following are equivalent. 
(i) E has good reduction over K. 
(ii) E[m] is unramified for every integer m > 1 relatively prime to p. 
(iii) E[m] is unramified for at least one integer m > 3 relatively prime 
to p. 

(b) Let m > 3 be an integer relatively prime to p. Then E has good 
reduction over K (E{m)). 


PROOF. (a) The equivalence of (i) and (ii) is [AEC, VII.7.1], and the im- 
plication (ii) = (iii) is trivial. So it suffices to prove that (iii) implies (i). 

We are given that E’/K has potential good reduction, so we can find 
a finite Galois extension L/K such that FE has good reduction over L. We 
are also given an integer m > 3 such that E[m] is unramified over K. Let £ 
be the largest prime dividing m, and let 


LS 0 5042 and 0 =4: if f=, 


Notice that é’|m since m > 3, so E[é’] c E[m]. Thus E[é’] is unramified 
over Kk. 

The fact that E has good reduction over L means that the inertia 
group I;7, acts trivially on the Tate module T(E) [AEC, VII.4.1b], so 
the inertia group Ip/% of L/K acts on T;(E). This action gives us a 
homomorphism 

Pp: Ihjk —_ Aut(Te(£)). 


Further, we are given that E[¢’] is unramified over K, so I; acts trivially 
on E/é’]. In other words, the image p(I;/x) is contained in the kernel of 
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the natural map 
Aut(Iz(E)) — Aut (E[¢’]). 


If we choose bases T;(E) © Z? and E[é’] © (Z/€’Z)?, then this last 
map becomes 
GLo(Ze) —> GLo(Z/¢’Z). 


It is an elementary exercise to verify that the kernel of this map, namely 
{M € GL2(Ze) : M =1(mod ae 


has no elements of finite order. (See exercise 4.38. This is the point at which 
we need ¢’ = 4 if € = 2, since if we took ¢’ = 2, then the matrix (~ -’) 


would be in the kernel.) 

We saw above that the image p(I;/x) is contained in this kernel. But 
the group Ix is finite, so it follows that its image p(Iz/%) is trivial. In 
other words, the inertia group I;/x% acts trivially on Ty(£), which proves 
that. T;(£) is unramified over K. Now [AEC, VII.7.1] tells us that E has 
good reduction over A’, which completes the proof that (iii) implies (i). 
(b) This follows immediately from (a), since E[m] is clearly unramified 
over the field K (E[m]). Oo 


We now resume the proof of Theorem 10.2(b), where, recall, we are 
assuming that & has integral j-invariant and that p > 5, and we are trying 
to verify that 6(£/K) = 0. Without loss of generality, we may replace K 
by its maximal unramified extension. 

For each integer m > 3 relatively prime to p, let Ly, = K (E[m]). Now 
Proposition 10.3(a) tells us that E has good reduction over L,,, and then 
another application of (10.3a) says that for any other m’, the set E[m’] is 
unramified over L,,, which means that the compositum LD, Lm is an un- 
ramified extension of L,,. But we took K = K™, so L,, has no unramified 
extensions, and hence Ly, C Lm. Reversing the role of m and m’ gives 
the opposite inclusion, which proves that all of the L,,,’s are the same. We 
write L for this common field. 

Now let @ > 3 be a prime with @ 4 p. (We will deal with the case € = 2 
later.) The action of the Galois group G(L¢/K) on E[é] gives an injection 


G(Le/K) — Aut(E[é]) ~ GLo(Z/EZ). 
It follows that 
#G(Le/K) | # GLe(Z/eZ). 


But we showed above that the field Lg = L is independent of £, so we find 
that #G(L/K) divides # GL2(Z/¢éZ) for all 2 4 2,p. The group GL2(Z/éZ) 
has order ¢(€ — 1)?(€+.1), and it is easy to see using Dirichlet’s theorem 
on primes in arithmetic progressions that 


ecd {0(€ —1)?(€+ 1)} = 48. 
&A2,p 
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Hence G(L/K) has order dividing 48. 

In particular, [L : K] is not divisible by p, since p > 5 by assumption. 
This proves that the extension L/K is at worst tamely ramified, so the 
higher ramification groups G;(L/K) are trivial for i > 1 (10.1d). It follows 
directly from the definition of 6(E/K) that 6(E/K) = 0. This completes 
the proof of (10.2b) for @ > 3. 

Finally, if € = 2, we use the fact that LD = L4 and that E has good 
reduction over L to conclude that 


G(L/K) — Aut(E[4]) = GL2(Z/4Z). 


This last group has order # GL2(Z/4Z) = 96 = 2°-3, so L/K is not wildly 
ramified at p, since p > 5. Hence 6(£/K) =0. 

(c) If p > 5, then (b) says that f(E/K) is an integer which depends only 
on the reduction type of E/K, hence is independent of £. The general case, 
which is due to Ogg [2], uses more machinery than we want to develop 
here. We have sketched the proof in exercise 4.46. For further details, see 
Ogg [2], Serre [7, chapter 19], and Serre-Tate [1, §3]. Oo 


If the residue characteristic of K is not equal to 2 or 3, then we 
have seen that 6(F/K) = 0, and so the exponent of the conductor sat- 
isfies f(E/K) < 2 from (10.2b). When the residue characteristic is 2 or 3, 
the exponent of the conductor is still bounded as described in the following 
result. 


Theorem 10.4. (Lockhart-Rosen-Silverman [1], Brumer-Kramer [1]) Let 
K/Q, be a local field with normalized valuation vx, and let E/K be an 
elliptic curve. Then the exponent of the conductor of E/K is bounded by 


f(E/K) < 2+ 3ux (3) + 6vx(2). 


(Here vx (p) is the ramification index of K/Q,.) Further, this bound is 
best possible in the sense that for every finite extension K/Q, there is an 
elliptic curve E/K whose conductor attains this bound. 


PROOF. We are going to prove the slightly weaker bound 
f(E/K) < 2+ 3ux(3) + 8vx (2), 


since the proof is easier and the weaker estimate suffices for most appli- 
cations. For an elementary, but involved, proof of the stronger inequality 
in certain cases, see Lockhart-Rosen-Silverman [1]. The proof for gen- 
eral K requires heavier machinery from representation theory; see Brumer- 
Kramer [1]. 

We begin with the observation that if p > 5, then 6(E/K) = 0 
from (10.2b) and e(£/K) < 2 directly from its definition, so f(E/K) < 2. 
It remains to deal with the cases p = 2 and p = 3. 
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Let & be any prime other than p, let LD = K(E[4), let Dr/K be the 
different of L/K, and let r be the smallest integer such that G,(L/K) = 1. 
We will need the following elementary properties of local fields: 


(i) vi(DrysK) = do (9:(L/K) -1), 

(ii) vL(Dr/K) < go(L/K) —1+ vz (go(L/K)), 
- vL(p) 

(iii) r< met 1 +1. 


See Serre [4, IV, §1, Prop. 4] for (i), Serre [4, III, § 7, remark following 
Prop. 13] for (ii), and Serre [4, IV, §2, exercise 3(c)] or Lockhart-Rosen- 
Silverman [1, Lemma 1.2(b)] for (iii). 

We are now ready to compute. 


f(E/K) = e(B/K) + 6(E/K) 


<24+6(£/K) since clearly «(E/K) < 2 
5. WG(L/K) ,, | 
=2+ . 0(L/K) dimr, (214/E14° a) 


since G;(L/K) =1 fori >r 


aa aR Ya L/K) since dims, (E[é) = 2 


2 
) CO 
= ERS (P+ La lotb/40) es 1)) 


he ( 
© go(L/K) 


v mare (2 +1+90(L/K)-1+ vx (go(Z/K)) 


r+vz(Or/k)) from property (i) 


from properties (ii) and (iii) 


eu +2+42vK(go(L/K)) since go(L/K) is the 


ramification index of L/K. 
Now suppose that p = 3, and take (say) = 5. Then 
G(L/K) — Aut(E[5]) = GL2(Z/5Z), 

so in particular go(L/K) divides # GL2(Z/5Z) = 480 = 2°-3-5. Hence 


vK (go(L/K)) < vx (480) = vx (3), 
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and substituting this in above with p = 3 gives the desired estimate, 
f(E/K) < 2+ 3x (3). 
Similarly, if p = 2, then we can take (say) ¢ = 3 and use the injection 
G(L/K) —> Aut(E[3]) & GL2(Z/3Z) 


to conclude that go(L/K) divides # GLo(Z/3Z) = 48 = 24-3. However, 
we can easily save a little bit. We are allowed to make an unramified 
extension of K, so we may adjoin to K a primitive cube root of unity. 
Then basic properties of the Weil pairing [AEC, III.8.1] imply that the 
image of G(L/K) lies in SL2(Z/3Z). (We sketch the proof below.) Hence 
the ramification index go(L/K) divides #SL2(Z/3Z) = 24 = 23-3. It 
follows that 
UK (go(L/K)) < UK (24) = vx (2°), 


and then substituting this in above with p = 2 yields 
f(E/K) < 2vx« (2) + 2+ 2ux (2°) = 2 + 8K (2). 


Let a be a uniformizer for kK. If p > 5, then (10.2b) says that any 
elliptic curve E/K with additive reduction will hit the maximum conductor 
exponent f(£/K) = 2. If p = 3, then we claim that the elliptic curve 


E:y=a=e4n 


satisfies f(E/K) = 2+3vx(3). Similarly, if p = 2, we claim that the elliptic 
curve 
E:y?+22y=2%—2? +02 


satisfies f(£/K) = 2+ 6vx(2). We could verify these claims by a lengthy 
direct calculation, but instead we will leave them for the reader to check 
(exercises 4.52 and 4.53) using Tate’s algorithm (9.4) and a formula of 
Ogg (11.1) to be proven in the next section. 

It remains to prove the assertion from above that the image of G(L/K) 
lies in SL2(Z/3Z). Fix a basis S,T € E[3]. Then e3(5,T) is a primitive 


cube root of unity, so it is in K. Let 0 € G(L/K), and let p(c) = G 2 € 


GL2(Z/3Z) be the matrix giving the action of o on E[3] relative to the 
chosen basis. Using [AEC, III.8.1(a,b,c,d)], we compute 
e3(S, T) => e3(S, T)° = e3($°,T7) S e3(aS + cT, bS + dT) 
= e3(S, apyee ve = €3(S, gp aect plo). 


Therefore det p(a) = 1, so the image of G(L/K) lies in SL2(Z/3Z). oO 


The conductor of an elliptic curve over a number field is defined by 
combining all of the local conductor exponents, just as the minimal dis- 
criminant was defined as a product of the local discriminants. 
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Definition. Let E/K be an elliptic curve defined over a number field K, 
and for each prime p of K, let f(E/K,) be the exponent of the conductor 
of EF consider as an elliptic curve over the local field Kp. The conductor 


of E/K is the ideal 
((E/K) = |] pf"). 
p 


Example 10.5. Let E/K be an elliptic curve defined over a number field, 
and suppose that EF has everywhere semi-stable reduction, by which we 
mean that EF has either good or multiplicative reduction at every prime. 
Then the conductor of E/K is the product of the primes of bad reduction, 


(E/K)= JJ pp, 
p|De/K 
where Dz; is the minimal discriminant of E/K [AEC, VII §8]. Con- 
versely, if the conductor f(£/K) is square-free, then E/K has everywhere 
semi-stable reduction. This follows from (10.2), which says that the con- 
ductor exponent satisfies f(E/K,) > 2 if and only if B/K, has additive 
reduction. 


Both the minimal discriminant and the conductor measure the extent 
to which an elliptic curve has bad reduction. We will see in the next 
section (11.2) that the exponent of the minimal discriminant is always 
greater than the exponent of the conductor, so we always have an inequality 
of the form 

NO (fey) < NG (Dex). 
Szpiro has conjectured that there should be an inequality in the other 
direction. 


Szpiro’s Conjecture 10.6. Fix a number field K and ane > 0. There 
is a constant c(K,e€) so that for every elliptic curve E/K, 


No (Deyx) < e(K,€)No (feyx)**- 


This conjecture, if true, lies very deep. Its validity would imply the 
solution to many other Diophantine problems, including for example the 
assertion that if a,b € Q* are fixed, if n > 2, and if m is sufficiently large, 
then the equation ax” + by™ = 1 has no non-trivial solutions z, y € Q. 

Surprisingly, it is quite easy to prove a function field analogue of 
Szpiro’s conjecture; see exercise 3.36. One can even prove such a result 
with ¢ = O and with an explicit constant c. The function field version 
of Szpiro’s conjecture was originally discovered by Kodaira (see Shioda [3, 
Prop. 2.8]) long before Szpiro formulated his conjecture, and it has been 
frequently rediscovered since that time; see for example Hindry-Silverman 
(2, Thm. 5.1] and Szpiro [1]. 
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811. Ogg’s Formula 


Let E'/K be an elliptic curve defined over a local field. Ogg’s formula relates 
the minimal discriminant of E/K, the exponent of the conductor of E/K, 
and the number of components on the special fiber of the minimal model 
of E over the ring of integers of A. This formula was originally proven 
by Ogg [2] in all cases except when K is a field of characteristic 0 with 
residue field of characteristic 2. Ogg’s proof relies on a lengthy case-by- 
case analysis. A more conceptual proof using scheme-theoretic techniques 
and working in all residue characteristics has been given by Saito [1], who 
proves Ogg’s formula as a special case of a general result for curves of 
arbitrary positive genus. An expanded exposition of Saito’s proof just in 
the case of elliptic curves can be found in Liu {1]. 


Ogg’s Formula 11.1. (Ogg [2], Saito [1]) Let K/Q, be a local field, 
let E/K be an elliptic curve, and let 


vx(Dg/K) = the valuation of the minimal discriminant of E/K, 
f(E/K) = the exponent of the conductor of E/K, 
m(E/K) = the number of components on the special fiber of E/K. 


Then 
v«(Deyx) = f(E/K) + m(E/K) - 1. 


Remark 11.1.1. The number m(F£/k) in (10.1) is the number of irre- 
ducible components defined over & on the special fiber of the minimal proper 
regular model of &/K. This includes all of the components, not just the 
multiplicity-1 components which make up the special fiber of the Néron 
model. Further, each component is counted once, regardless of its multi- 
plicity. For example, if E/K has Type I, reduction, then m(E/K) =n, and 
if E/K has Type III* reduction, then m(E/K) = 8. The value of m(E/K) 
for these and the other reduction types can be found in Table 4.1. 


Remark 11.1.2. The minimal discriminant and special fiber of E/K can 
be computed in a straightforward manner using Tate’s algorithm (9.4). For 
this reason, Ogg’s formula (11.1) is frequently used to compute the expo- 
nent of the conductor of E/K for residue characteristics 2 and 3. See for 
example exercises 4.52 and 4.53, as well as the conductor tables contained 
in Birch-Kuyk [1] and Cremona [1]. 


Proor (of Ogg’s Formula 11.1). If £/K has good reduction, then 
vK(Deyw) =90, f(E/K)=0, and m(E£/K) =1. 


These three equalities follow from [AEC, VII.5.1a], (10.2b), and (6.3) re- 
spectively. (Notice that (6.3) says that the minimal Weierstrass equation 
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for E/K is already the Néron model, so its special fiber is irreducible.) 
This verifies Ogg’s formula when F’/K has good reduction. 

Next suppose that E/K has multiplicative reduction. Then f(£/K) = 
1 from (10.2b). Further, Step 2 of Tate’s algorithm (9.4) tells us that the 
special fiber of the minimal model of E'/K is an m(E/K)-sided polygon, 
with m(E/K) = vx(De/xK). This verifies Ogg’s formula when E/K has 
multiplicative reduction. 

Finally, suppose that E'/K has additive reduction. Consider first the 
case that p > 5. Then (10.2b) tells us that f(E/K) = 2, so Ogg’s formula 
becomes 

UK (Deyx) = mM(E/K) +1. 


It is now a simple matter using Table 4.1 to verify Ogg’s formula case-by- 
case, checking each of the reduction types II, II,..., II*. 

It remains to consider p = 3 and p = 2 when E£/K has additive 
reduction. We will give a direct case-by-case verification for p = 3, since 
it only takes a few pages. A similar proof for p = 2 would be extremely 
lengthy, so for this last case we refer the reader to Saito’s proof [1] which 
works in all residue characteristics and does not rely on a case-by-case 
analysis. (See also Liu [1].) Unfortunately, the papers of Saito [1] and 
Liu [1] use techniques which are beyond the scope of this book. 

So we now assume that p = 3 and that E/K has additive reduc- 
tion. In particular, (9.2a) tells us that the tame part of the conductor 
is e(£/K) = 2. Further, the fact that p = 3 means that we can find a 
minimal Weierstrass equation for E/K of the form 


E:y? = 7° + apr? + asx + a6. 
The discriminant of this equation is 
A = -16(4a3ag — a3a3 + 4a} + 27a — 18aza446). 


Using this simplified form for F will make all of our calculations easier. 
Let L = K(£[2]) be the field generated by the 2-torsion points of EF, 
so L is the splitting field over K of the cubic polynomial 


f(x) = 2° + ax? + age + a6. 


Further, let M = K (JA). Notice that the discriminant of the polyno- 
mial f(x) satisfies A = —16Disc(f). Thus we see that K C MC L and 
a f[L: kK) =2 

2 i : = 2 or 6, 
IM K]= 43 if [ZL : K] = 1 or 3. 


Suppose first that E/K has Type III reduction, so m(E/K) = 2. 
A quick perusal of Step 4 of Tate’s algorithm (9.4) shows that E£ has a 
Weierstrass equation satisfying 


ve(a2)>1, vela4)=1, vK(ag)>2, and vK(A) =3. 
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We claim that L/K is at worst tamely ramified. To see this, we let Tx 
be a uniformizer for AK and use the fact that vx (A) = 3 to observe that 
™M = Te A is a uniformizer for M. Notice in particular that M/K is a 
ramified extension of degree 2. Now consider the polynomial 


g(x) = map f (mux) = 2° + ayy age? + Typ aan + Tp a6 € M{z]. 


The coefficients of g are in the ring of integers of M, and the discriminant 
of g has valuation 


v(Disc g) = v(aqp Disc f) = v(n%&/A?) = 0. 


Hence the splitting field of g over M, which is L, is unramified over M. 
This proves that L/K is not wildly ramified, so 6(E/K) =0. We have now 
computed all of the pieces in 


vx (Dex) — f(E/K) — m(B/K)+1=3-(2+0)-2+1=0, 


which completes the proof of Ogg’s formula for p = 3 and Type III reduc- 
tion. 

If E/K has Type IlI* reduction, then a similar calculation shows 
that vx (A) = 9 and that K(F[2]) is tamely ramified over K. So again we 
find that 


vk (Dex) — f(E/K) —m(E/K)+1=9- (2+0) -8+1=0. 


We leave the details to the reader (exercise 4.54a). 

For the remaining reduction types (II, III, IV, I*, [V*, UI*, II*), we 
are first going to show that if Ogg’s formula is true for F/M, then it is also 
true for E/K. More precisely, if we write Ogg(E/K) for the quantity 


Oge(£/K) = 0K (Dy) — f(B/K) — m(E/K) +1, 
then we will show that 
Ogs(E/M) = e(M/K) Ogs(E/K), 
where e(M/K) is the ramification index of M/K. It is clear that this equal- 
ity holds if M/K is unramified, since none of the quantities in Ogg(E/K) 
will change, so we only need to consider the case that M/K is ramified. 
Assuming now that M/K is ramified, we have e(M/K) = 2, so the 


ramification is tame. It follows that the higher ramification groups for L/K 
and L/M are the same, 


G;i(L/M) = G;(L/K) for alli > 1. 
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Further, M/K is a ramified extension of degree 2, so go(L/K) = 2go(L/M). 
Using these two facts and the definition of wild part of the conductor, we 
compute 


6(E/K) = , ota dime, (ER)/ER)o4/") 
_~ g(L/M) ,. , 
~ 24 Bgo(L/My (Ryze) 


1 
= 56(E/M). 


Notice that the full conductor does not satisfy such a simple relation, 
since e(E/K) = 2, and we will see that ¢(E/M) may be any of 0, 1, 
or 2. 

The next step in our proof that Ogg(L/M) = 2 Ogg(L/K) is to verify 
the following table describing how various quantities change when we make 
the ramified quadratic extension M/k. (For more extensive tables, see 
exercises 4.48, 4.49, and 4.50.) 


Type(E/K) 1 av iP ae le si Iv* | Ir 
(n 2 1) 
Type(E/M) IV | IV* | Io | Ion Iv | Iv* | 
2uK(De/K) — um(De (Mt) 0 0 12 : 12 12 12 
2f(E/K) — f(E/M) 2 2 4 3 2 2 
2m(E/K) — m(E/M) -1}]-1{9 [10 Let | ad 


Notice that the value of 2m(E/K) — m(E/M) in the last line is easy to 
compute by using the first two lines and reading off the number of com- 
ponents for each reduction type from Table 4.1. Similarly, the identity 
6(E/K) = 6(E/M)/2 that we proved above implies that 


2f(E/K) ~ f(E/M) = 2e(B/K) ~ e(E/M), 


so the penultimate line of the table follows immediately from the first two 
lines and the fact (9.2a) that « = 0 for good reduction, ¢ = 1 for multi- 
plicative reduction, and « = 2 for additive reduction. The verification of 
the remainder of the table is now simply a matter of tracing the various 
reduction types through Tate’s algorithm. We will do Type IV* reduction 
to illustrate the idea, and leave the other cases for the reader. 

So suppose that E/K has Type [V* reduction. Then Step 8 of Tate’s 
algorithm gives a minimal Weierstrass for E'/K satisfying 


vK (a2) > 2, vK (a4) > 3, UK (ag) = 4. 
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(Remember we can assume that a; = a3 = 0.) Fix a uniformizer my, 
for M and make the change of variables x = 73,2’, y = 7},y’. This gives 
a Weierstrass equation for £/M of the form 


2 3 2 
E:y’ =2" +a5zr" +a4z' +46, 


a a —6 ‘ 
where ay = Trp 2, a, = Ty G4; and ag = 7, a6. In particular, we have 


vm (ay) = 2uK (a2) -2>2, vas(ay) = 2vK (a4) — 4 > 2, 
um (a6) = 2uK(a6) —-6 = 2, vm (A’) = 2uK (A) — 12. 


Now a quick check of Step 5 of Tate’s algorithm shows that this is a minimal 
equation for F/M and that E/M has Type IV reduction. Further, A’ = 
Ty A, 80 


vm (Deym) = vM(4‘) = 2uK(A) — 12 = 2un(Dzyx) — 12. 


Finally, a fiber of Type IV* has seven components, and a fiber of Type IV 
has three components, so 2m(E/K) — m(E/M) = 11. This completes the 
verification of the Type IV* column in the above table. The other columns 
may be verified similarly. 

We now use this table to compute 


2 Oge(E/K) — Oge(E/M) 
= {2uxK (Dey) —um(Dejm)} — {2f(E/K) — f(E/M)} 

— {2m(E/K) — m(E/M)} + {2-1} 
0-—2-(-1)+1=0 if Type(£/K)=II or IV, 
12-4-—-9+1=0 if Type(£/K)=I5, 
12-3-10+1=0 if Type(E/K)=I*¥, n> 1, 
12-2-11+1=0 _ if Type(E/K)=IV* or II. 


This completes the proof that Ogg(E/M) = 2 Ogg(E/K), so it now suffices 
to prove Ogg’s formula for F/M. We note from the table that E/M is of 
Type IV, IV*, Io, or In, so it suffices to consider these four cases. 

If E/M has Type Io reduction, which is to say E/M has good reduc- 
tion, or if F/M has Type I, reduction with n > 1, which means F/M has 
multiplicative reduction, then we are done, since we have already verified 
Ogg’s formula for good and multiplicative reduction. 

Suppose now that F/M has Type IV reduction. Using Step 5 of Tate’s 
algorithm, we see that our Weierstrass equation satisfies 


um (a2) > 1, um (a4) > 2, um (ae) = 2. 


Let a € L be a root of the polynomial f(x) = 2° + agxz? + asx + ag. Note 
that the degree of L/M is either 1 or 3, and L is the splitting field of f, so 
we have L = M(a). We want to use the fact that a satisfies the equation 


f(a) = a? + aga? + aya + ag = 0 
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to compute its valuation. First we observe that 


3uqr(@) = vas (aza? + aga t ag) > min{vjs(a2a”), vm (a4Q), vm (a6) } 


> min{1 + 2var(a), 2+ vm (a), 2}, 
so vys(a@) > 2/3. Similarly, 


2 =v (a6) = vm (a? + aga? + asa) 
> min{vas(a"), var (a2a7), var (asa) 
> min {3vqs(a), 1+ 2va¢(a),24+ vu (a)}, 
which gives the opposite inequality 2/3 > vas(a@). Hence vas(a) = 2/3. In 


particular, this proves that L/M is totally ramified of degree 3, so L/M is 
wildly ramified. Further, if we choose a uniformizer 7,, for M, then 


def 
ap = tu/a€eL 


will be a uniformizer for L, since var(7z) = 1/3. 

We can use 7; to determine the higher ramification groups for L/M. 
The Galois group G(L/M) is a cyclic group of order 3, so there is an 
integer r > 1 such that 


G(L/M) = Gi(L/M) =--- = G,-1(L/M), 
and 
G,(L/M) = Gr4i(L/M) =--- = 1. 


We want to compute this integer r. Writing G(L/M) = {1,0,07}, the 
definition of the higher ramification groups says that 
r=v,(t% — Tr). 
We substitute 7, = m,¢/a and use the fact that m,, € M to get 


T 7 
r= UL ( ae =u) = ut(t™m) + vt(a — a7) — vp(aa”). 


The extension L/M is totally ramified, so vz = 3uyz. This means that 
ut(mm) = 3 and vyz(a) = vz (a) = 2, and hence 


r=vuz(a—a’?)—1. 


Further, the exact same calculation gives r = vz (a@ — a?) — 1, and since 
2 

the valuation is Galois invariant, we also find that r = vz(a’ — a” )—-1. 

Adding these three expressions for r yields 


3r = v~p(a— a’) + uz (a - a’) +uz(a’ — a”) —3 


Sip ((a ~ a7 \(a — a” )(a? — a”’)) = 
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Notice that a, w%, and a? are the three roots of f(x), so the discrim- 
inant of f(a) is 


Disc(f) = — ((a ~a7)\(a — a” \(a? — a”) 


We observed above that A = —16 Disc(f), which gives us the formula 
1 . 1 1 
aera (Dise(f)) —1= gtE(A) -l= gu (A) —1. 


This relation is the key to proving Ogg’s formula, since it relates the conduc- 
tor, via the higher ramification groups, to the discriminant of the minimal 
Weierstrass equation. 

The non-trivial elements of G(L/M) act on the non-zero elements 
of E[2] via a permutation of order 3, so the only element of E[2] fixed 
by G(L/M) is O. This means that E[2]¢*4/™) equals E[2] for i > r and 
is trivial for i < r, and hence 


. | 2 ifti<r, 
dime, (B21/E12)%"") = . ifi>r. 


Using this and the value for r computed above, we can determine the wild 
part of the conductor directly from the definition: 


6(B/M) = = an dime, (B(2|/B[2|4/"?) 


o(L/M) 
ee oe —1) 
= um (A) - 


=vum(Dejm) -—4 


Adding this to the tame part <¢(£/M) = 2 of the conductor gives the 
relation 


f(E/M) = &(E/M) + 6(E/M) = um(Deym) — 2. 


It only remains to recall that we are working with a curve F/M having 
Type IV reduction. This means that m(E/M) = 3, so the last relation can 
be rewritten as 


vm(Deyjm) = f(E/M) +2 = f(E/M)+3-1= f(E/M)+m(E/M) -1. 
This completes the proof of Ogg’s formula when E'/M has Type IV reduc- 


tion. The proof for Type IV* reduction is similar, so we leave it for the 
reader. Oo 
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Corollary 11.2. Let K/Q, be a local field, let E/K be an elliptic curve, 
let vx(Dg/K) be the valuation of the minimal discriminant of E/K, and 
let f(E/K) be the exponent of the conductor of E/K. Then 


F(E/K) < vx (De/x), 


with equality if and only if E/K has reduction type Ip, i, or II. 

Proor. When p > 5, the stated inequality is quite easy to verify by a 
direct calculation, since it essentially comes down to showing that if the 
discriminant satisfies ux (Dg/x) = 1, then E/K has multiplicative reduc- 
tion. We proved this fact earlier (9.5a). In general, Ogg’s formula (11.1) 
tells us that 


vK(Dzyyx) — f(E/K) =m(E/K) —12 0, 


since the number of components certainly satisfies m(E'/K) > 1. Further, 
there is equality if and only if m(£/K) = 1, which occurs exactly for 
reduction types Ip, I,, and II. oO 


EXERCISES 


4.1. (a) The special orthogonal group SO, is defined to be 
SO, = {M ESL, : ‘MM =I}, 


where 'M denotes the transpose of the matrix M. Prove that SO, is an 
affine group variety. 
(b) The orthogonal group On is defined to be 


On ={M€GL, : ‘MM = I}. 


Prove that there is an isomorphism O, = SO, xZ/2Z of group varieties. 


4.2. Let A be the matrix A = es 1 ), where J, is the n x n identity matrix. 


The symplectic group Sp.,, is defined to be 
SPan — {M © SLan : "MAM = A}. 


Prove that Sp,,, is an affine group variety. 


4.3. Let E be an elliptic curve, and let d : E — E be a morphism satisfy- 
ing ¢(O) = O. Use the Rigidity Lemma 1.8 to prove that ¢ is a group 
homomorphism. (This provides an alternative proof of [AEC, III.4.8] not 
requiring the theory of Picard groups. This proof readily generalizes to 
abelian varieties of arbitrary dimension.) 
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4.4. 


4.5. 


4.6. 


4.7. 


4.8. 


4.9. 


4.10. 


Let E be an elliptic curve, and let wp: E x FE — E be a morphism satisfying 
p(P,O) = p(O,P) = P for all Pe E. 

Prove that u(P,Q) = P+ Q for all P,Q € E. 

(a) Let A be a regular local ring, and let 8 be a prime ideal of A. Prove 

that the localization of A at $B is a regular local ring. 

(b) Let X be a scheme, let x,y € X be points, and suppose that z is in 

the closure of y. (N.B. Points of a scheme need not be closed.) If x is a 

regular point of X, prove that y is also a regular point. Hence X is regular 

if and only if all of its closed points are regular. 

(c) With notation as in (b), give an example to show that it is possible to 

have x singular and y non-singular. 

Let R be a discrete valuation ring with fraction field K, residue field k, 

and maximal ideal p, and let 7 be a uniformizer for R. Let X/R be the R- 

scheme defined by X = Spec R[t]/(zt). 

(a) Prove that the generic and special fibers of X are given by X,, ~ Spec K 

and Xp & Aj. Note that X, is smooth over K and that Xy is smooth 

over k. 

(b) Prove that X is not smooth over R. This shows that something like 

the irreducibility condition in (2.9) is necessary. 

*Let @ : X — S be a morphism of finite type of Noetherian schemes, 

let x € X, and let s = ¢(z). Prove that ¢ is smooth at z if and only if ¢ is 

flat at x and the fiber X; is smooth over the residue field of S at s. (This 

shows that what is really going wrong in the previous exercise is the fact 

that X is not flat over R, since its fibers have different dimensions.) 

Complete the proof that G(T) is a group by verifying the following two 

facts, where we use the notation from (3.2). 

(a) (60) *¢@=0007r for all 6 € G(T). 

(b) x (wx A) = (d * W) *A for all ¢, 0, € G(T). 

Let S be a scheme, and let Ga and Gm be the additive and multiplicative 

group schemes respectively (3.1.2, 3.1.3). 

(a) Prove that Ga(S) =T(S, Qs). 

(b) Prove that Ga(S) = [(S, 0%). 

Here Os is the structure sheaf on S, and I'(S, F) denotes the global sections 

of the sheaf J. 

(a) For each integer r > 1, let fs, be the scheme 


br = Spec Z[T]/(T" — 1). 


Prove that there is a natural inclusion uy — Gm so that p, is a (closed) 
subgroup scheme of Gm. The group scheme yp, is called the scheme of 
r*_roots of unity. 

(b) Let R be a ring of characteristic p > 0, and for each integer r > 1 
let apr be the R-scheme 


Qpr = Spec R{T|/(T? ). 
Prove that there is a natural inclusion apr — Ga;p so that apr is a (closed) 
subgroup scheme of Ga/p. 


(c) Let R be asin (b). Prove that ap and j1,7/pR are isomorphic as schemes 
over R, but that they are not isomorphic as group schemes over R. 
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4.11. 


4.12. 


4.13. 


4.14. 
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Let G be a group scheme over S. 
(a) Prove that for each integer m, the multiplication-by-m map [m] : G > 
G described in (3.4) is a morphism. 
(b) Prove that [—m] = [—1] 0 [m]. 
(c) More generally, prove that [mn] = [m] o [n]. 
(d) Prove that [m+n] = po ([m] x [n]). 
(e) With notation as in (3.4), prove that (0] = oo o 7 and that [—1] =7. 
Let R be a ring, let d € R, and let Ga C A?, be the affine scheme given by 
the equation 

Ga: 2? —dy’? =1. 


(a) Prove that the composition law 


Ga XrGa — Ga, 
((x1, y1); (@2, y2)) > (a1a2 + dyi ye, t1y2 + 241) 
gives Gg the structure of a group scheme over R. 


(b) Prove that Go fits into the following exact sequence of group schemes 
over R: 


0 > Gasr >» Go > H2/R >]. 


(Here p12;p = Spec R{T]/(T? — 1) is the scheme of square roots of unity; 
see exercise 4.10(a).) 

(c) Prove that Go is isomorphic to Ga/p X RH2/R as group schemes over R. 
(d) Let di, d2 € R. Prove that Ga, is isomorphic to Ga, as group schemes 
over R if and only if there is a unit u € R* such that d; = u7do. 

(e) If 2 € R*, prove that G; and Gm;g are isomorphic as group schemes 
over R. 


Let K be a field of characteristic 0, and for each d € K, let Ga/K Cc AX 
be the group variety 


Gaia’ —dy’ =1 


described in the previous exercise. Prove that every connected group va- 
riety of dimension one defined over K is isomorphic over K to one of the 
other following group varieties: 

(i) The additive group Ga; x. 

(ii) The group Ga for some d € K*. (Note that G; is isomorphic to the 
multiplicative group Gm x.) 

(iii) An elliptic curve defined over K. 


Let © C P? be the arithmetic surface given by the equation 
C:y? =a + 2a? +6. 


Complete the proof (4.2.2) that © is a regular scheme by verifying that C 
is regular at the points x = y=3=0 and 2+66= y=97=0. 
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4.15. 


4.16. 


4.17. 


4.18. 


4.19. 


4.20. 


4.21. 


Let R be a discrete valuation ring with normalized valuation v, and let W C 
P2, be the arithmetic surface given by the Weierstrass equation 
W: y + airy t+ az3y = z+ agu + asx + ac. 
Let A and j be the associated discriminant and j-invariant. 
(a) If v(A) = 1, prove that W is a regular scheme. 
(b) If v(A) = 2 and v(j) > 0, prove that W is a regular scheme. 
(The computations are simpler if you assume that 2 and 3 are units in R, 
but the results are true in general.) 
Let @/Z be the arithmetic surface in A? defined by the equation 
C: (a? + 4a? + 8a — 1)y? — (224 4+ 2? — 2? — 22)y? 
— (a® — 32° + 324 — o*)y + Qe" 4+ 2% — ® — 20* = 7. 
Describe the special fiber C7 of C over the point (7) € SpecZ; that is, 
describe the components of C7, their multiplicities, and their intersection 


points. Draw a sketch (in R?) illustrating C7. (See (4.2.4) and Figure 4.2 
for a similar calculation.) 


Let 7 : © — Spec(&) be a regular arithmetic surface over a Dedekind 
domain R, let p € Spec(R), and let x € Cy C © be a non-singular closed 
point on the fiber of C over p. Complete the proof of Proposition 4.3 by 
proving that 7*(p) ¢ M2.» 

This exercise generalizes (4.4). Let R be a Dedekind domain with fraction 
field kK, and let X be a “nice” scheme over R whose generic fiber X/K is 
a smooth, projective variety. (Here “nice” has the same meaning as in the 
definition of arithmetic surface; see §4.) 

(a) If X is proper over R, prove that X(K) = X(R). 

(b) Suppose that X is a regular scheme, and let X° C X be the largest 
subscheme of X with the property that X° is smooth over R. Prove 
that X(R) = X°(R). 

Let R’/R be an extension of discrete valuation rings with maximal ide- 
als p, p’, fraction fields K, K’, and residue fields k, k’ respectively. Sup- 
pose that K’/K is a finite extension. Prove that R’ is the localization of a 
smooth R-scheme if and only if p’ = pR’ (i.e., R’/R is unramified). 

Let R be a discrete valuation ring with fraction field K, and let E/K be 
an elliptic curve given by a Weierstrass equation 


E:y +a:rzy = 2° +46 with a; € R* and ag € R. 


Let W Cc P% be the R-scheme defined by this Weierstrass equation. 
(a) Prove that W is a regular scheme if and only if v(ag) < 1. 
(b) Prove that W is smooth over R if and only if v(ag) = 0. 


Let E/K, W/R and W°/R be as in the statement of Theorem 5.3, where 
we assume that we start with a minimal Weierstrass equation for E/K. 
(a) If E/K has split multiplicative reduction, prove that the special fibers 
of W®° and Ga /R are isomorphic as group schemes over the residue field 
of R. 

(b) If £/K has additive reduction, prove that the special fibers of W° 
and Ga; are isomorphic as group schemes over the residue field of R. 

(c) Give a similar description of the special fiber of W° in the case that 
the curve E/K has non-split multiplicative reduction. 
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4.22. 


4.23. 


4.24. 
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Let R be a discrete valuation ring with fraction field K, let E/K be an 


elliptic curve, and choose a Weierstrass equation for E/K with coefficients 
in R, 


B:Y?Z+aiXYZ + a3YZ? = X? 4.a2X°Z + a4XZ? + 062°. 
Let W C P} be the R-scheme defined by this Weierstrass equation, and let 
pw: WxW---+W 


be the rational map induced by the addition law on the generic fiber. Define 
affine subsets of W by 


War ={Z #0} and = Wag = {¥ £0}. 


Prove that y is a morphism when restricted to each of the following sets: 
(a) Wat XR Whe. 

(b) We XR War. 

(c) War Xr War. 

(The formulas will be easier if you assume that 2 and 3 are units in R and 
take a Weierstrass equation of the form Y?Z = X* + AXZ? + BZ >. We 
described the behavior of 4 on Wag XR Wag during the proof of (5.3). This 
exercise is asking you to complete the proof of (5.3).) 


Let R be a Dedekind domain with fraction field K, let E/K be an elliptic 
curve, let C/R be a minimal proper regular model for E/K, and let E/R be 
the largest subscheme of C/R which is smooth over R. Note that E€/R is a 
Néron model for E'/K, so in particular E is a group scheme over R (6.1). 
(a) Let P € E(K) & E(R), and let tp : E — E be translation-by-P (3.3). 
Prove that tp extends to an R-morphism C — C. 

(b) Prove that every automorphism a: E/K — E/K extends to an R- 
morphism © — €@. 

(c) *Prove that the group law € xr E — E extends to give a group scheme 
action E€ xr eC C. 

(d) Prove that in general the group law € xr E — E does not extend to 
give an R-morphism C xr CC. 


Let R be a discrete valuation ring with fraction field K, residue field k, and 
residue characteristic p. Let E/K and E’/K be elliptic curves, and let €/R 
and €'/R be Néron models for E/K and E'/K respectively. Let 6x : E > 
E’ be an isogeny of degree m > 1 defined over K. Assume that either p 
does not divide m, or else that E/K does not have additive reduction. 

(a) *Prove that dx extends to an R-morphism ¢r : E > €’. 

(b) Prove that ¢r is a homomorphism of R-group schemes. 

(c) Prove that the restriction of ¢r to the special fiber is a finite mor- 
phism ¢y : &/k = é'/k which maps the identity component of € to the 
identity component of €’. 

(d) Prove that there is an R-morphism @z : €’ > € with the property that 
the composition droge : € > € is the multiplication-by-m map on € (3.4). 
This generalizes the construction of the dual isogeny for elliptic curves over 
fields [AEC, III §6]. 
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4.25. 


4.26. 


4.27. 


4.28. 


Let R be a Dedekind domain with fraction field K, let E/K be an elliptic 
curve, and let €/R be a Néron model for E/K. Prove that the connected 
component of €/R as described in (6.1.2) is a subgroup scheme of €/R. 


Let R be a Henselian discrete valuation ring with valuation v. Let f(x) € 
R{z] be a monic polynomial, and let a € R be an element with the property 
that 

u(f(a)) > 2v(f'(a)). 


Prove that there is a unique element a € R satisfying 
v(a—a) > v(f'(a)) and f(a) =0. 


(Note the strict inequalities. ) 


Let R be a discrete valuation ring with maximal ideal p and residue field k, 
let fi,..-,fm € Rlxi,...,2n], let 


X = Spec R[x1,..-,2n]/(fi,---, fm) 
be the scheme defined by the fi’s, and let 


J = J(£1,...,2n) = (Of: /OL5)1<i<mi<j<n 


be the associated Jacobian matrix. 

(a) Prove that the generic fiber of X is empty if and only if some power 
of p is contained in the ideal (fi,..., fm). 

(b) Let @ = (G1,...,Gn) € X(k) be a point on the special fiber of X. Prove 
that Xp, is smooth over & at a if and only if the matrix J(a) satisfies 


rank J(@) = n — dimX,. 


(c) Assume that X is irreducible, reduced, and has non-empty generic fiber. 
Suppose further that the ring R is Henselian, and let a = (a@1,...,@n) € 
A"(R) be a point satisfying 


fila) =--: = fm(a) = 0(mod p) and rank J(a) = n—dimX,. 
Prove that there exists a (unique) point a € A”(R) such that 
fila) =--: = fm(a) =0 and a@ =a(mod p). 


This result is a multi-variable version of Hensel’s lemma. It implies the 
surjectivity of the reduction map for smooth schemes over Henselian rings. 
(Hint. First prove (c) under the assumption that R is complete.) 


Let R be a discrete valuation ring with residue field k, and let R® and R* 
be the Henselization and strict Henselization of R respectively (6.5). 

(a) Let R’ be a Henselian discrete valuation ring, and let i: R — R’ 
be a local homomorphism. Prove that there is a unique local homomor- 
phism R’ — R’ extending i. 

(b) Let R” be a strictly Henselian discrete valuation ring with residue 
field k”, let i: R — R” be a local homomorphism, and let u: k” > k 
be a k-homomorphism. Prove that there exists a unique local homomor- 
phism R** — R” which extends i and which induces the map u on the 
residue fields. 
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4.29. 


4.30. 


4.31. 


4.32. 


IV. The Néron Model 


*Let K be an algebraically closed field, let C/K be a curve of genus g > 1, 
and let V be the g-fold symmetric product of C. Note that the points of V 
can be naturally identified with the positive divisors of degree g on C’. 

(a) Fix a basepoint Po € C. Prove that there is a rational map wp: V xV > 
V determined by the property 


u(z,y) ~2+y— g(Po). 


(Here ~ denotes linear equivalence of divisors.) 

(b) Prove that w is a normal law, and hence from (6.9) that there is a 
group variety J/K associated to py. 

(c) Prove that the map V — J is a morphism, and deduce that J is proper 
over K. 

(d) Prove that the map of sets 


V — Pic®(C), x +— class[z — g(Po)], 


induces an isomorphism of groups J(K) — Pic®(C). 

The group variety J is an abelian variety called the Jacobian variety 
of C. This construction of the Jacobian variety is due to Weil [3]. For a 
further discussion, see the proof sketch of Proposition III.2.6(b). 


Let R be a strictly Henselian discrete valuation ring with fraction field Kk. 
Let €/R be a group scheme over R whose generic fiber E/K is an ellip- 
tic curve. Prove that E/R is a Néron model for E/K if and only if the 
inclusion €(R) > E(K) is a bijection. 

Let R be a discrete valuation ring with uniformizing element 7 and alge- 
braically closed residue field k. Assume that char(k) 4 2,3,5. Let C C A? 
be the affine scheme defined by 


C:y=atn’: 


that is, C = Spec Riz, y|/(y? — 2° — 2”). 

(a) Show that © is regular except at the one point 7 = x = y = 0 on the 
special fiber. 

(b) Compute the blow-up of © at the singular point 7 = x = y = 0 as 
explained in (7.7). Show that the resulting scheme is still not regular. 

(c) Continue blowing up until you get a regular scheme. Draw a picture 
of the special fiber similar to the diagrams in Figures 4.3 and 4.4. 

(d) Repeat (a), (b), and (c) for the arithmetic surface y? = 2° +77. 


(a) For each of the Kodaira-Néron reduction types (8.2), compute the 
intersection incidence matrix of the special fiber. 

(b) Show that each of the incidence matrices in (a) has determinant 0. 
(c) Let M be the matrix obtained by taking any one of the incidence matri- 
ces in (a) and deleting a row and column corresponding to a multiplicity-1 
component. Show that det(M) is equal to plus or minus the number of 
multiplicity-1 components on the special fiber. Equivalently, |det(M)| is 
the order of the group of components of the Néron model. 
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4.33. 


4.34. 


4.35. 


4.36. 


4.37. 


4.38. 


Let R be a discrete valuation ring, and let C/R be a proper regular model 
of a curve of positive genus. Generalize part (b) of the previous exercise 
by showing that the incidence matrix of its special fiber has determinant 0. 
(For a generalization of part (c), see Raynaud [1].) 


Let R be a discrete valuation ring with maximal ideal p, fraction field K, 
and algebraically closed residue field k. Let C/K be a non-singular projec- 
tive curve of genus g > 1, and let C/R be a minimal proper regular model 
for C/K. Suppose that the special fiber C, of C contains a configuration 
with ¢ components of the form shown in Figure 4.6(b), where each of the 
illustrated components satisfies pa(F;) = 0 and F? = —2. 

(a) Prove that t < 4g +2. 

(b) If t = 4g + 2, complete the picture of Cy. In particular, show that Cp 
has exactly 4g +5 components. (For g = 1, you’ll get a fiber of Type II*.) 


Let R be a complete discrete valuation ring with fraction field K and al- 
gebraically closed residue field k of characteristic p # 2,3. Let E/K be 
an elliptic curve with additive reduction, and let Eo(K) be the subgroup 
of E(K) consisting of points with non-singular reduction (9.2). 

(a) Prove that Eo(K) is uniquely divisible by 2 and 3. 

(b) Prove that E(K)/Eo(K) is killed by 12. 

(c) Prove that the natural map E(K)[12] —~ E(K)/Eo(K) is an isomor- 
phism. 


(a) Let Ca, Cs € Z be integers with C? — C§ 4 0. Prove that there exists 
a Weierstrass equation 


y tarry + agy = 2° + age” + age + a6 


with coefficients a1, a2,a3,@4,a6 € Z satisfying ca = Cs and ce = Ce if and 
only if one of the following two conditions is true: 

(i) ord3(Cs) # 2 and Cg = —1 (mod 4). 

(ii) ord3(Ce6) A 2, ord2(C4) > 4, and Ce = 0 or 8 (mod 32). 


(b) Use the criteria in (a) to devise a quick algorithm to check whether a 
given Weierstrass equation with integer coefficients is a minimal Weierstrass 
equation. 

(c) Generalize the criteria in (a) to an arbitrary field K/Q. 

Let R be a discrete valuation ring with fraction field AK, let E/K be an 
elliptic curve given by a Weierstrass equation 


y + acy + a3y = 2° + ax + age + a6 


with coefficients in R, and assume that mla3,@4, 7?|a6, and x { bz. Resolve 
the singularity on the special fiber by a sequence of blow-ups and show that 
the special fiber is of Type I, with n = v(A). 
Let @ be a prime, let ’ = @ if € 4 2, and let & =4 if €= 2. Prove that the 
group 

{M € GL2(Ze) : M =1(mod £’)} 


contains no elements of finite order. 
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4.39. 


4.40. 


4.41. 


4.42. 


4.43. 


4.44. 


IV. The Néron Model 


This exercise generalizes the previous exercise. Let K/Q, be a p-adic field 
with ring of integers R, maximal ideal p, and normalized valuation vx. 
(a) Suppose that there is a matrix M € GLn(R) of exact order m > 2 
satisfying 

M=1 (mod p’). 


Prove that 


m=p and rs Re) 


(b) Ifr > vx(p)/(p — 1), prove that the group 
{M €GL,(R) : M =1(mod p")} 


contains no elements of finite order other than the identity element. 


Let E/K and E’/K be elliptic curves defined over a local field, and let 
@: E— E’ be a non-constant isogeny defined over K. Prove that 


e(E/K)=e(E'/K), — 6(E/K) = 6(E'/K), — f(E/K) = f(E'/K). 


Notice that this generalizes the assertion [AEC, VII.7.2] that E and E’ 
either both have good reduction or both have bad reduction, since (10.1) 
says that good reduction is equivalent to ¢ = 0. 


Let K/Q3 be a 3-adic field, let E/K be an elliptic curve, let 2 # 2 be a 
prime, and let L = K(E{é]). 

(a) Prove that the first higher ramification group Gi(L/K) is either trivial 
or acyclic group of order 3. (Hint. Show that Gi(L/K) is independent of 2, 
and then take £ = 2.) 

(b) Prove that Gi(L/K) is a cyclic group of order 3 if and only if E/K 
has reduction type II, IV, IV“, or II”. 

Let K/Q2 be a 2-adic field, let E/K be an elliptic curve with potential 
good reduction, let £ > 3 be a prime, let L = K(E[é]), and let Gi(L/K) 
be the first higher ramification group of L/K. Prove that 


Gi(L/K) = {1} or Z/2Z or Z/4Z or Ha, 


where Hg is the quaternion group of order 8. (Hint. Show that Gi(L/K) 
is independent of @, and then take ¢ = 3.) 

For each of the following elliptic curves E/Q2, let L = Qe(E[3]) and 
compute the first higher ramification group Gi(L/Q2). 

(a) E:y?+2y=2°. 

(b) E:y?+2ry +8y = 2°. 

(c) E:y?+2y = 2° + 2z. 

Let K be a number field. Prove that for any constant B there are only 
finitely many elliptic curves E/K whose conductor fz x satisfies 


ING (feyx)| < B. 
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4.45. 


4.46. 


4.47. 


Let L/K be a finite extension of local fields as described at the beginning 
of §10, let mz be a uniformizer for L, and define an index function 


itjx : G(L/K) — ZU {ov}, trjK() = uL(aZ — TL). 


(a) Prove that iz/xK(o) >1+ 1 if and only if o € Gi(L/K). 
(b) Prove that iz;x(to7T~') =iz/x«(c) for all 0,7 € G(L/K). 
(c) Prove that iz/x(o7) > min{iz/x%(o),ir/x(7)} for all 0,7 € G(L/K). 
We continue with the notation from the previous exercise, with the addi- 
tional assumption that the extension L/K is totally ramified. For basic 
material on the representation theory used in this exercise, see for example 
Serre [7]. The Artin character Arz;x and the Swan character Swr/% are 
defined to be the functions 

Arik: Gry — Z@, Swrk > GiyK — Z@, 
Arig (0) =—trse(o) ifoA1, Swrx(o)=1-injw(o) ifo Al, 


Arrj« (1) = So irsx (0); Swix (1) = >_(iz/«(o) - 1). 
oFl oFAl 
(a) A function p on G(L/K) is called a class function if y(tar~') = ¥(c) 
for allo,r € G(L/K). Prove that Arz;x~ and Swz/x are class functions. 
(b) *For any pair of functions ~1, #2 on G(L/K), define 
1 -1 
(v1, b2) = CHT S| > wbi(o)pe(a”). 
o€G(L/K) 
If y is the character of an irreducible representation of G(L/K), prove that 
(x, Arty«) and (x,5wz/x) are non-negative integers. 
(c) Replace K by its maximal unramified extension, and let E/K be an 
elliptic curve with integral j-invariant. Let L/K be a finite Galois extension 
such that E has good reduction over L. Further let yg be the character of 
the representation of G(L/K) on T(E). Prove that xg takes values in Z 
and is independent of @. 
(d) *Continuing with the assumptions from (c), prove that 
6(B/K) = (Swiyx,xe) and f(£/K) = (Arr, Xe). 
Deduce that 6(£/K) and f(£/K) are integers that are independent of £. 
This proves (10.2c) in the case that EF has potential good reduction. 
(e) If E/K has non-integral j-invariant, prove that 6(£/K) and f(£/K) 
are integers and are independent of @, thus completing the proof of (10.2c). 
(Hint. For (e), use the isomorphism E(K) © K*/q® described in (V.5.3) 
and exercise 5.11.) 
Let K/Q, be a p-adic field with p > 5, let ux be the normalized valuation 
on K, and let E/K be an elliptic curve. Prove that there exists a minimal 
Weierstrass equation for E/K with a: = a2 = a3 = 0, and with aa, ae, 
and A as described in the following table: 
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4.48. Let K/Q3 be a 3-adic field with normalized valuation vx, and let E/K be 
an elliptic curve. Prove that there exists a minimal Weierstrass equation 
for E/K with a; = a3 = 0, and with a2, a4, ag, and A as described in the 
following table: 


4.49. Let K/Q, be a p-adic field with p > 3, let ux be the normalized valuation 
on K, and let E/K be an elliptic curve with Type I}, reduction. 
(a) Ifn = 0, prove that vx (j(Z)) > 0 and vx (De/x) = 6. 
(b) Ifn > 1, prove that vx (j(£)) = —n and ux (De/K) =n+6. 
(c) Let L/K be a tamely ramified extension with ramification degree e = 
e(L/K). Prove that E/L has Type Ine reduction if e = 0(mod 2), and E/L 
has Type I}, reduction if e = 1 (mod 2). 
(d) Give an example of an elliptic curve E/Qe with Type I*, reduction 
satisfying n > 1 and v2(j(E)) > 0. This shows that (b) is not true for p = 2. 
What is the largest possible value of n in this situation? 


4.50. Let K/Q3 be a 3-adic field with normalized valuation vx, and let L/K be 
a tamely ramified extension with normalized valuation vz. Let e = e(L/K) 
be the relative ramification degree, so vr = evx. Let E/K be an elliptic 
curve whose reduction type is one of II, IV, IV*, or II*. 

(a) Prove that the reduction type of E/L is given by the following table: 


Type(E/K) || e=1 (6) e = 2 (6) e=4 (6) | e€=5 (6) 
II aI IV Iv* II* 


IV IV TV" Iv. | Iv 
Iv* Iv of iv Nt IV 
[ ir * Ive [OV I 


(b) Let De;x and Dey, be the minimal discriminants of E/K and E/L 
respectively. Prove that the value of the difference 


e(L/K)ux(De/K) — vi(Deyz) 


is given by the following table: 


Type(E/K) € = 1 (6) e = 2 (6) e=4 (6) e = 5 (6) 
I 2e—2 2e—4 2e-—8 2e — 10 
IV 4e—4 de -8 4e—4 
Iv* 8e-8 | 8e-4 8e —8 
II* 10e — 10 10e—8 10e — 4 
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4.51. 


4.52. 


4.53. 


4.54. 


This exercise illustrates how wild ramification can cause the reduction type 
of an elliptic curve to change in an irregular fashion. It may be compared 
with the previous exercise, which dealt with the tamely ramified case. 

Let K/Q3 be a 3-adic field with normalized valuation vx, and let E/K 
be an elliptic curve given by a Weierstrass equation 


E:y=2' +a¢6 with v% (a6) = 1. 


Let L/K be a ramified extension of degree 3, so L/K is wildly ramified. 
(a) Prove that E/K has Type II reduction. 

(b) If vx (3) = 1 (ie., if K/Q3 is unramified), prove that E/L has Type III* 
reduction. 

(c) If vx (3) = 2, prove that E/L has Type Io (i.e., good) reduction. 

(d) If vx (3) = 3, prove that E/L has Type III reduction. 

(e) Try to find a general formula for the reduction type of E/L. Does the 
reduction type of E/L depend only on vx (3)? 


We continue with the notation from the previous exercise, so K/Q3 is a 
3-adic field and E/K is an elliptic curve 


E:y=2%+a6 with vx (a6) = 1. 


Prove that the conductor exponent of E/K is f(E/K) = 2+3vx(3). Notice 
that (10.4) says that this is the largest allowable conductor exponent for 
an elliptic curve defined over a 3-adic field. 


Let K/Q2 be a 2-adic field with normalized valuation vx, and let E/K be 
an elliptic curve given by a Weierstrass equation 


E:y? +2ry = 2° — 2? +a42 with vx (a4) = 1. 


(a) Prove that the equation given for F is a minimal Weierstrass equation 
and that vx (Dex) = 6vx (2) +3. 

(b) Prove that E has Type III reduction and that the special fiber of E 
has two components. 

(c) Prove that the conductor exponent of E/K is f(E/K) = 2+ 6vxK(2). 
Notice that (10.4) says that this is the largest allowable conductor exponent 
for an elliptic curve defined over a 2-adic field. 


This exercise asks you to verify two cases of Ogg’s formula (11.1) that were 
not completed in §11. Let K’/Qs3 be a 3-adic field, let E/K be an elliptic 
curve, and let L = K(E[Q)]). 

(a) If E/K has Type III* reduction, prove directly that L/K is a tamely 
ramified extension. Use this to verify Ogg’s formula for E/K. 

(b) Let M = K(VQ), and suppose that E/M has Type IV* reduction. 
Give a direct proof of Ogg’s formula in this situation. (Hint. Mimic the 
proof for Type IV reduction given in §11.) 


CHAPTER V 


Elliptic Curves over Complete Fields 


Every elliptic curve E/C admits an isomorphism C*/q” = E(C) by complex 
analytic functions, and we have seen amply demonstrated in Chapters I 
and II the importance of such uniformizations. In this chapter we are 
going to study uniformizations over other complete fields such as R and 
finite extensions K/Q,. We begin in §1 with a brief review of the relevant 
formulas over C, and then in §2 we use the complex uniformization to 
investigate elliptic curves over R. 

We next turn to elliptic curves defined over p-adic fields K/Q,. Tate [9] 
has shown that for every g € A’* with |g| < 1 there is an elliptic curve E,/K 
and a p-adic analytic isomorphism K*/g” ~ E,(K). In §3 we will describe 
the Tate curve E, and prove all of its main properties except for the sur- 
jectivity of the map K* — E,(K’), which we reserve for §4. Tate has also 
shown that every elliptic curve E/K with non-integral j-invariant is iso- 
morphic, possibly over a quadratic extension of K, to some Ey. We will 
prove this result and describe the necessary twisting in §5, and then in §6 
we will give some applications, including Serre’s proof that an elliptic curve 
with complex multiplication has integral j-invariant. 


§1. Elliptic Curves over C 


We have already discussed elliptic curves and elliptic functions over C in 
some detail; see [AEC VI] and (I §§5-8). The purpose of this section is to 
gather and rewrite in a convenient form the formulas we will use later in 
this chapter when we study elliptic curves over R and over p-adic fields. 
Let E'/C be the elliptic curve corresponding to the normalized lattice 


A, =Zr+Z_ for some r € H. 


We know that E(C) = C/A,, the isomorphism being given in terms of the 
Weierstrass g-function and its derivative. As in (I §6), it is convenient to 
let 

use", g=e*™", and q={q* : ke Z}. 
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Note that since Im(r) > 0, we have |g| < 1. There is a complex analytic 
isomorphism 
C/ke— ot" Jaq", zeouset™ 


and we can use the q-expansions from (I §§6,7) to explicitly describe the 
isomorphism E(C) & C*/q”. 
The elliptic curve E has the Weierstrass equation 


E : y? = 42 — go(r)x — ga(r). 
From (1.7.3.2) we have 


1 
Gnit 27 = 5 al 


1 1 
(amin 937 T) = sia 


1 + 240s3(q)], 
1+ 504s5(q)], 


where in general 


nk g™ 
x(q) = Do oe(n)a" = aa 


n>1 n>1 


(For the second equality, see exercise 5.1. Here o4(n) = >> d|n d* as usual.) 
We’ve collected the powers of 277 as indicated to make it easier to eliminate 
them. 
Next, the isomorphism 
C/e — E(C) 
u +> (9(u,q),9"(u, 4) 


is given by the power series described in (1.6.2) and (1.6.2.1): 


age hl) = qe + gy 2nla) 


neZ 
1 ; qru(1 + q”u) 
(Bmis® (Hd = aera (l—qruys 


(Note that g’ is the derivative of g with respect to z, where u = e?7**.) 
Jacobi’s formula (1.8.1) says that the discriminant of the Weierstrass 
equation for EF has the product expansion 


A(r) = g2(r)* — 2793(r)? = (2mi)'?q [[ A - 9"), 


n>1 
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and the j-invariant of E is given by the series (1.7.4) 


1 
ji(g = at 744 + 196884q +-- +S > e(n)q” with e(n) € Z. 


n>0 


ae 


It is convenient to make a change of variables, partially to remove the 
powers of 277 and partially to eliminate the powers of 2 and 3 appearing 
in the denominators of the series for go, g3, and g. Thus we let 


eee 
(2Qri)2- 12’ 

1 
ay = 2, / a 
Qae 


which gives the new Weierstrass equation 


y? tay =x" +042! +46 


with 
le 1 
Cee (anit 27) T 38° 
Eo (sa 1 
=F Rie) — Be amie) + 758° 


Now using the series for go, 93, 9, 9” and doing a little algebra, we find we 
have proven virtually all of the following result. 


Theorem 1.1. For u,q € C with |q| < 1, define quantities 


sx(q) = >. on(n)q” ye 


n>1 n>1 ci 


Mea! sage oe, 


12 
X(ua) = >> i702 — 281(q), 


neZ q u)? 
(ru)? 
Y(u,q) = 20 aa (i — grup + s81(q), 


Ey: y? +ary = 2° + a4(q)x + a6(q). 


(a) E, is an elliptic curve, and X and Y define a complex analytic isomor- 
phism . 
6: C/y — E,(C) 
U K— (X(u,q),¥(u,@)) ifu ¢ v, 
O ifu € q®. 
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(b) Written as power series in q, both a4(q) and a6(q) have integer coeffi- 
cients; that is, a4(q),ae(q) € Z[q]. 
(c) The discriminant and j-invariant of E, are given by the formulas 


A(q) = a6 + af + 72aqag — 64a3 — 43202 
q[[G-4")* € Za), 


n>1 


j(q) = 1 +744 + 196884q +--- 


+ » c(n)g” € z + Z[q]. 
n>0 q 


QlrR 


(d) For every elliptic curve E/C there is aq € C* with |q| < 1 such that E 
is isomorphic to Eq. 


PROOF. The discussion given above has proven all of (a), (b), and (c) 
except for the minor point that the power series for ag(q) has integer coef- 
ficients. Since 


‘sti _5s3(q) + 785(Q) _ 5o3(@) + 705(4) mn 


12 & 12 
5d3 + 7d 
=: x(x 12 a", 


it suffices to observe that 
5d°+7d° =0(mod 12) for all d€ Z. 


(Notice that we used this same fact in the proof of (I.7.4a).) This completes 
the proof of (c). 

Finally, to prove (d), we note that the uniformization theorem (I.4.4) 
says that every elliptic curve E/C is isomorphic to C/A for some lattice A. 
Then the change of variables used above transforms the Weierstrass equa- 
tion for & into an Eg. 

oO 


Remark 1.2. It is sometimes convenient to rewrite s;(q) in the alternative 
form 
ya 
(0) = 0 aa 
rec Cae 


To check that these two expressions for s1(q) are the same, we substi- 
tute T = q” into 


T d 1 d ea EN 
tee 


m>1 


412 V. Elliptic Curves over Complete Fields 


and sum over n > 1, which yields 


Sep ee = Dee = 


_ 
n>1 n>lm>1 m>1 n>1 m2>1 q 


For later reference, it will be helpful to rewrite the formulas 


a(z+a)a(z— a) 


9(z) — (a) = - s@o(aQ? at p'(z) =- 


from (1.5.6) in terms of X(u,q) and Y(u,q). For this purpose, we introduce 
a normalized theta function. 


Proposition 1.3. Define a normalized theta function 0(u,q) by the for- 


mula = 
O(u,q) = a-w TJ = re i ) 


2 
n>1 q”) 


(a) 0(u,q) converges for all u,g € C* with |g| < 1 and satisfies the func- 
tional equation 


a(qu,q) = ~—6(u, 4). 


(b) @ is related to the Weierstrass o function by the formula 
(ug) =~ seb) e-™9(u, 9) 
, Qri nae 


where u = e2™"*, q = e?"'7, and n(1) is the quasi-period associated to the 
period 1 in the lattice Zr + Z. 
(c) @ is related to the functions X(u,q) and Y(u,q) described in (1.1) by 
the formulas 
: 20 (ur U2, g)O(ui ug", 9) 
1 X(u — X(u ; = = 
Me aD aa) Oeasa) 
u0(u*, 4g) 
A(u,q)4 
ProoF. (a) Since |q”u| < 1 and |g”u~!| < 1 for all sufficiently large n, it 


is clear that the product defining 6 converges. Then replacing u by gu and 
renumbering the factors in the product gives 


(ii) 2Y(u,g) + X(u,q) = - 


= qrtt _ qn-1,,-1 
NCO ed ees Tee 


n>1 


2028-5 [] Hota pa se a(n, 4). 
n>1 
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(b) This is immediate from the definition of @ and the product formula 
for o given in (1.6.4). 
(c) From above, X and Y are related to g and @’ by the formulas 


1 1 1 
X = ——9- — d 2Y4+X = ——~ 9’. 
Qn” 72 * To > nays 
Hence writing u; = e277! and ug = e?""*2, we find (dropping the q from 
our notation) 


a 
(271)? 


X (ui) — X(u2) = (@(u1) — (u2)) 


_ 1 o(uyu)o(uyuy') oe y, 
~~ BaIP oft )Pa(uay? om (6) 


—e all) ((21 +22)? + (21-22)? -227 -223) 


Leo T(21+22)+(21—22) —221 — 222) : O(uyU2)O(uruz ') 
O(u)?0(u2)? 
from (b) 
ere O(uyu2)O(ur Us *) 
° O(ur)?0(u2)? 


This proves (i), and (ii) is proven by the similar calculation 


2¥ (u) + X(u) = ope) 


Oo u2 
=- gat from (I.5.6b) 


— ean l)((22)?—427) « eMi(2z—42) O(u?) 
A(u)* 


from (b) 


A(u)*” 
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The uniformization theorem (1.4.4) says that every elliptic curve defined 
over C is analytically isomorphic to C*/q” for some g = e?7'7. Since an 
elliptic curve defined over R is automatically defined over C, it has such a 
model. We begin by describing a set of 7’s which classifies elliptic curves 
over R up to C-isomorphism. 
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Proposition 2.1. Let E/R be an elliptic curve. Then there exists a 
unique T in the set 


Caditits hole ; sSo<chulo+e#ses 


such that 
j(r) = i(E). 
(The set € is illustrated in Figure 5.1.) 


Proor. First we check that j(C) CR. If r = it or 7 = 5 + it with t ER, 
then ¢ € R, so the g-expansion (I.7.4b) 


ir)=a' +> oe(n)q", — e(n) €Z, 
shows that j(7) € R. Next, for any 7 we have 


=~ (2m __ —2riltl?/r. 
gee = eo 2nilr l/s, 


a of trl 
HT) =3 ( aris 
Hence for r = e*” we have 


je?) = 9 (-=:) =Jj ((! ci) = j(e’), so j(e?) ER. 


This proves that j(C) C R. 
Next we observe that 


so in general 


‘ a _ 4: 24 ni 
a a a a 
ica! poh hs ne Ne ot -1 seme 
jim 3 (5 + it) = oo q+ > c(n)g” = —oo. 


By continuity, we conclude that 7(C) = R. (Note that 7 : C > R is 
continuous, since j7 : H — C is holomorphic.) Finally, (1.1.5b) and (1.4.1) 
imply that 7 : C — R is injective, which concludes the proof that the 
map 7: €C — R is a bijection. o 


Proposition 2.1 completely describes all C-isomorphism classes of ellip- 
tic curves defined over R. However, since R is not algebraically closed, it is 
possible to have more than one R-isomorphism class in each C-isomorphism 
class. For a given F/R, these other curves are called the twists of E 
(see [AEC X §5]). Our next result classifies these twists. 
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The Set © for Which j(r) E R 
Figure 5.1 


Proposition 2.2. (a) Let E/R be an elliptic curve. Then the C-iso- 
morphism class of E contains exactly two R-isomorphism classes. (In the 
notation of [AEC X §5], Twist((E,O)/R) = {+1}.) 

(b) More precisely, define an invariant y(E/R) € {+1} by the rule 
sign(ce), if 7 #1728 (i.e., if cg #0), 


E/R) = 
meee) oo if 7 = 1728 (i.e., if cg = 0). 


(Here c4 and cg are the usual quantities associated to some Weierstrass 
equation for F/R.) Let E/R and E’/R be elliptic curves. Then 


E2E' ovrR <> j(E)=j(E’) and 7(E/R) = 7(E'/R). 
Proor. From [AEC X.5.4], the twists of E are in one-to-one correspon- 
dence with the elements of the group 

R*/R*", where n = # Aut(E) € {2, 4, 6}. 


Since n is even, R*/R*” ~ {+1} has two elements. This completes the 
proof of (a). 
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For (b) we use the more precise description of the twist provided by 
the second part of [AEC X.5.4]. If E/R is given, and if E’/R is the twist 
of E/R corresponding to some D € R*/R*", then [AEC X.5.4] says that 


c6(E’) = D3c6(E) if j(E) 40,1728, 
c6(E’) = Deo(E) if j(E) =0, 
c4(E’) = Dea(E) if j(E) = 1728. 
Hence in all cases (E/R) 
F x 
sign(D) = ———, 
Bn?) = 87) 
so the ratio y(E/R)/y(E’/IR) determines whether E’ is isomorphic to E or 
to its non-trivial twist. Oo 


Remark 2.2.1. For an analogous result over p-adic fields, see (5.2). 


By combining Propositions 2.1 and 2.2, we can now give a convenient 
set of q’s which completely classifies all R-isomorphism classes of elliptic 
curves. For any g = e?7’7, we let E, be the elliptic curve 


E,:y? +2y = 2° + aa(q)x + a6(q), 
where a4(q) and ag(q) are the power series described in (1.1), and we let 
@:C*/q EC), ou) = (X(u, 9), ¥(u,9)), 


be the C-analytic isomorphism from (1.1). 

Theorem 2.3. Let E/R be an elliptic curve. 

(a) There is a unique q € R with 0 < |q| < 1 such that 
BE p Eq 


(i.e., E is R-isomorphic to E,). 
(b) Composing the isomorphism from (a) with the map ¢ described above, 
we obtain an isomorphism 


w:C*/q* —> E(C) 


which commutes with complex conjugation, that is, w is defined over R. In 
particular, 
wi RY /q® — E(R) 


is an R-analytic isomorphism. 


Proor. Note first that if q € R, then the series (1.1) imply that a4(q) 
and ag(q) are in R, so Ey is defined over R. We want to start by using (2.1) 
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to find a 7 with j(r) = j(£). As noted during the proof of (2.1), the 
points T = it and 7 = $ + it give real values of g. However, for T = e’® we 


do not have q € R. So we use the transformation 
-1 . 1 L 6 
a(r) = aa) which satisfies a(e®) = 5 + cot a 


Thus a € I(1) yields a bijection 


Since j(a7) = j(7), we conclude from (2.1) that there is a unique 7 in the 


set 
it: ¢t>1)>U oe pact 
at: = : = 
= grt 2 


such that j(7) = j(E). Note that in the second set we do not allow t = 3, 
since this would give the duplicate value 


feidig 3 “Ase op tas ve 
i($ +58) =slelw) = 00. 
The set of g = e?"’7 € R with 0 < |q| < 1 corresponds bijectively with 
the 7’s in the set 
: | eee 
v= fit : erobul sit : trol. 


Further, the transformations 


give identifications 


Sift :4 > 1} fits t< 1}, 


1 5 1 md 1 : } 
ar{yriete sha {Seusecah, 
; 1. 
Q@:tror~+ —2. 


2° 2 


(See Figure 5.2.) 
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A Set of + Giving All q € R with 0 < |q| <1 
Figure 5.2 


Since j(y7) = j(r) for any 7 € SL2(Z), we see from above that the 

map 
j:T—-R 

is exactly two-to-one. From (2.2a) there are exactly two R-isomorphism 
classes of elliptic curves with a given j-invariant. Hence to complete the 
proof of (a) we must check that if 7,7’ € J are distinct points with j(7) = 
j(r’), then E, and Ey are non-trivial twists of one another. To do this we 
will use (2.2b). 

The change-of-variable formulas for Weierstrass equations [AEC III §1] 
imply that 


ca(q) = u*(12go(r)) and —eg(q) = u°(216g3(7)) 


for some u € C*. (In fact, our explicit formulas imply that u = (27i)7!, 
but for our purposes it suffices to know that u does not depend on 7.) Now 
suppose that 


,__at+b 
 ert+d 


b 
T for some (: y € SL2(Z). 
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Then using the y-invariant defined in (2.2b) and the fact that go and gs 
are modular forms, we find that 


(Byl®) i (6010) — sion (99°) — sign tor a ay 
Em ~ 8 (Seay ) =e (ey) = selon +) 


if j(r) # 1728. 


Similarly, 


sy cs (Sa) ne (25 ) = sign{(er + d)"} 


if 7(7) = 1728. 
We must show that for the 7’s and 7’’s described above, all of these signs 


are —1. This requires checking several cases. 


1 
Casel: r=it,t>1. 7 =Sr=—-. 
Te 


In this case j(7) # 1728, and 
(cr +d)® = (it)® = —t® < 0. 


tod 

—2r-1 

Again we are in a case in which j(7) # 1728, and 
(cr + d)® = (27 — 1)§ = (2it)® = —64t® < 0. 


Case III: 7 =i. ee ee 


1 
Case II: Tg als te 7! = Br 


This is the case that j(7) = 1728, and 
(er + d)* = (r-1)* = (@-1)* = -4 <0. 

Hence in all cases Ey and FE, are distinct twists, which completes the 
proof of (a). 
(b) The series for X(u,q) and Y(u,q) show that if g € R, then the map 

@:C*/q? +E (C), — o(u) = (X(u, 9), ¥(u, 9g) 

commutes with complex conjugation. Since the isomorphism E = E, in (a) 
is defined over R, it follows that the composition w : C*/q” — E(C) is also 


defined over R. This proves the first half of (b), and the second follows by 
taking Gc;g-invariants of the exact. sequence 


1 — q” — C* — E(C) — 0, 


so we get R*/q” —> E(R). (Note that H'(Gcg,q*) = Hom(Ger,Z) = 
0.) Finally, it is clear that this isomorphism is R-analytic, since we know 
that w is C-analytic and is given by power series with coefficients in R. 

O 


An elliptic curve over R has either one or two components. We can 
use (2.3) to give a criterion to determine which case holds. 
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Elliptic Curves over R with One and Two Components 


Figure 5.3 


Corollary 2.3.1. Let E(R) be an elliptic curve, and let A(E) be the 
discriminant of some Weierstrass equation for E/R. Then there is an iso- 
morphism of real Lie groups 


_ ( R/Z, if A(E) <0, 
( = yay x (Z/2Z), if A(E) > 0. 


ProoF. Fix an isomorphism EF =/g Eq as in (2.3a). Then 


ul? A(E) = A(E,) = [[(-4")* 
n>1 
for some u € R, so 
sign A(E) = sign A(E,) = signg. 


Now (2.3b) says that E(R) & E,(R) & R*/q”, so the following isomor- 
phisms complete the proof of Corollary 2.3.1: 

log |u| 
log |q| 


i 1 
R*/q’ —>R/Z, urs S ( — sign(u) + 1) (modZ), if q <0, 


log |u| 
log q 


(modZ), sign(u)), if g > 0. 
Oo 


R*/q” > (R/Z) x {+1}, ure ( 
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We conclude our discussion of elliptic curves over R by describing the 
Weil-Chatelet group WC(E/R). (See [AEC X §3] for basic facts about the 


Weil-Chatelet group.) As in (2.2), the fact that Gc,p is so small leads to 
a very simple answer. 


Theorem 2.4. Let E/IR be an elliptic curve, and let A(E) be the dis- 
criminant of some Weierstrass equation for F/R. Then 


0 if A(E) <0, 


WOR) = re if A(E) > 0. 


Proor. From [AEC X.3.6] there is an isomorphism 

WC(E/R) = H'(Geyr, E(C)). 
Choose a q € R and an R-isomorphism E = g Ey as in (2.3). Then 

E(C) = C*/q" as Gc/r-modules, 

so we have an exact sequence 

0 — q* — C* — E(C) — 0 
of Gcr-modules. Since 

H'(Gcyr,C*) =0 


from Hilbert’s Theorem 90 (or by an easy direct calculation), the long exact 
sequence in Gc/r-cohomology gives 


0 — WC(E/R) — H? (Gor, q”) — H?(Geyr,C’). 


Now Gc;g = {1,¢} is cyclic of order 2, so for any Gcyr-module M 
(written multiplicatively) we have 


~ {2eEM : o(x) =x} 
H?(Gcyr, M) & ‘{a-o(2):2€ My" 


(This is a special case of a general formula for the cohomology of cyclic 
groups. See exercise 5.2.) Since Gc,g acts trivially on q@, we find 


H? (Ger. @) = #/"”, 


ay AO RE yam ie 
H? (Gcyr,C*) = Tl? pu e Gh} ~ R*/R*?. 
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Hence we finally obtain an exact sequence 
0 — WC(E/R) — q°/¢°* — R*/R”, 


where the right-hand map is induced by the natural inclusion q7 ~~ R*. 
From this exact sequence we immediately conclude that 


WC(E/IR) = 0 if¢g <0, 
~ ee ifg>0. 


But as we observed during the proof of (2.3.1), 


ul? A(E) = A(q) =4 [[a Sages for some u € R, 


n>1 


so sign A(E) = signg. This completes the proof of Theorem 2.4. al 
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We have seen that every elliptic curve defined over the complex numbers 
has a parametrization C/A for some lattice A C C. Suppose we replace C 
by Q, and endeavor to parametrize an elliptic curve E/Q, by a group of the 
form Q,/A. Unfortunately, this approach immediately fails, because Qp 
has no non-trivial lattices. Indeed, if A C Q, is any non-zero subgroup 
and 0#teéA, then 


p't€ A foralln >0O and lim p"t=0, 


n—- Co 


so 0 is an accumulation point of A. Hence Q, contains no discrete subgroups 
other than 0. 

Tate’s idea is to first exponentiate, which leads to the alternative de- 
scription C*/q” for elliptic curves over C. Now the analogous situation 
over Q, is much more promising, since Q has lots of discrete subgroups. 
For example, any q € Q5, with |g| < 1 defines the discrete subgroup 


g ={q" :nEZycQ. 


Further, the series described in (1.1) will converge in Q, and give a p-adic 
analytic isomorphism of the quotient Q7/ q with a certain elliptic curve E,. 
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The situation is nicely summarized by the following picture (taken from 
Robert [1, II §5]). 


Complex case p-adic case 


C/A no p-adic analogue 


tial : : 
pees no exponential available 


C*/q@ Q* /q” : p-adic elliptic curve. 


More generally, we can work over any p-adic field K, by which we mean 
a finite extension K/Q,. All of these facts (and more) are contained in 
the next theorem, the proof of which will keep us busy for the next two 
sections. 


Theorem 3.1. (Tate) Let K be a p-adic field with absolute value | - |, 
let q € K* satisfy |q| < 1, and let 


kan 
= TF, aula) =—5s5(q), g(a) = SOF sla) 


be the series described in (1.1). 
(a) The series a4(q) and ag(q) converge in K. Define the Tate curve E, by 
the equation 

E,: y? +2y = 2° + a4(q)x + a6(q). 


(b) The Tate curve is an elliptic curve defined over K with discriminant 


A=q]JG—4")* 


n>1 


and j-invariant 


' 1 1 
j(Eq) = © + 744 + 196884q ++ = = + VV e(mg 
n>0 


where the c(n)’s are the integers described in (1.1). 
(c) The series 


X(u,q) = ier me 2s1(q), 


neZz 


Yu) = {Po +0), 


3 
néeZ q ”u) 
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converge for allu € K, u¢ q”. They define a surjective homomorphism 


g: ke E,(K) 


u we E(X(u,9,¥(u,g)) ifudd’, 
O ifueq. 


The kernel of @ is q”. 
(d) The map ¢ in (c) is compatible with the action of the Galois group 
Gx x in the sense that 


d(u’) =¢(u)” foralluc K*,0€ GkyK: 
In particular, for any algebraic extension L/K, ¢ induces an isomorphism 


6: Lt /q? ~ E,(L). 


Remark 3.1.1. The p-adic uniformization described in Theorem 3.1 is es- 
pecially useful for arithmetic applications because it is compatible with the 
action of Galois as described in (3.1d). Note that a complex uniformization 
C/A — E(C) or C*/q” — E(C) does not have this compatibility (except 
relative to Gc;g), since in general one cannot apply an element of Galois 
to the value of a convergent series by applying it to each term of the series. 
(See exercise 5.8.) 


Remark 3.1.2. Theorem 3.1 is actually true for any field K that is com- 
plete with respect to a non-archimedean absolute value. The only time we 
will use the fact that K is a finite extension of Q, will be in the proof that 
the map ¢ in (3.1c) is surjective. (In fact, we will really only need the fact 
that the absolute value is discrete, so our proof actually is valid somewhat 
more generally, for example over the completion of Q>*.) For a proof of 
Theorem 3.1 in the most general setting, using p-adic analytic methods, 
see Roquette [1]. 


PROOF. (a) From (1.1b), the series defining a4 and ag are in Z[q], so they 
will converge in K for any value of g € K satisfying |g| < 1. 
(b) The discriminant of Eg is 
A(q) = —ag + a3 + 72a4a6 — 64a} — 4322. 
Substituting in the series for a4(q) and ag(q), we find the usual power series 


A(q) = q — 24q? + 2524? + --- = q (mod q’). 


Hence |A(q)| = |q| # 0, so Ey is a non-singular elliptic curve. 
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Next we observe from (1.1c) that the identity 


A(q) =a] [@-4")* 


n>1 


holds for all g € C with |g|.. < 1, where for the moment we write | - | 
for the usual absolute value on C. It follows that this identity is true as 
an identity of formal power series in Z[q]. Hence it remains true when we 
take q to be an element of absolute value less than 1 in any field that is 
complete with respect to a non-archimedean absolute value. 

Finally, the formula 


i(@) = (1 + 48a4(q))° 
A(q) 
_ 1+ 240g + 21609? + --- 
"gq — 24g? + 25293 + 


1 
=(1 + 744q + 196884q” + ---) 
q 


holds in the non-archimedean case, since it is obtained formally by taking 
the quotient of the appropriate power series. 

(c) We begin by rewriting the series for X and Y as follows, where we’ve 
used the alternative expression (1.2) for s1(q): 


qu q” 
X(u,q) = Gay Hes = (1 —q-"u)? SG a) 
gru-! q” 
ge 2 Aes Ver ayaa 
(q” u)? (q7"u)? g” 
Hi) = Ga pst 2 cera cms Goes t Oey) 


qru7} q” 
=u eared mus G-quatys Teroak 


These expressions show immediately that X(u,q) and Y(u,q) converge for 
allu € K*\q%. (Note that although K itself is not complete, every term in 
these series is in the field K(u) = Q,(u,q), which is a finite extension of Q,; 
so we are really working in the complete field K(u). Similar comments will 
apply below whenever we speak of substituting elements of K into a series.) 
The functional equations 


X (qu,q) = X(u,q) = X(u7',@) 
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are now obvious, the first equality from the original series for X and the 
second from the rearranged series we just gave. A little algebra gives similar 
functional equations for Y, 


¥(qu,q) =¥(u,q) and Y(u~!,q) = —-¥(u,q) — X(u,q). 


If we restrict u to the range |g| < |u| < |g|~1, we have |g”u| < 1 
and |g”u~'| < 1 for all positive integers n. So we can use the expansion 


a para Gs ae =) > mq@’T)” = (mr) a4 


n>1 n>lm>1 d>1 mld 


with T equal successively to u, u~!, and 1 to rewrite the series for X as 


X(u,q) = me +O (Somu +uT ™ —2))qt € Q(u)lal, 


d>1 mld 
valid for |g| < |u| < |g|~?. 


A similar calculation allows us to write Y as a power series in q with 
coefficients in Q(u): 


(m—1)m um — m(m + 1) —m d 
Y(u,q) = oe 310 iar ag +m})q 
€ Q(u)[q], valid for |g| < |u| < |q\7’. 


We begin our proof of (c) by showing that the image of the map ¢ is 
contained in the curve E, given by the Weierstrass equation 


Ey: y? +ry = 2° + a4(q)z + a¢6(q). 


This amounts to showing that when we substitute the series X(u,q) and 
Y(u, Le. for x and y in this equation, we get an identity valid for all u € 
K* \ q@. By the periodicity of X and Y, it is enough to consider values 
of u such that |q| < |u| < 1 and u #1. In this range we can use the above 
formulas which express X and Y as power series in q with coefficients that 
are rational functions of u. Thus we will be done if we can show that the 
equation 


Y(u,q)? + X(u,q)¥(u,q) = X(u,q)? + a4(q)X(u,q) + a6(4) 
is valid as a formal identity in the ring of formal power series in q with 


coefficients which are rational functions of the indeterminate u. In other 
words, we want to verify that this identity holds in the ring Q(u) [gq]. 
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From (1.1), we know that this equation is true numerically if we sub- 
stitute any pair of complex numbers u,g € C in the domain of conver- 
gence |gloo < |uloo < lg|ai, u # 1. If we fix some u with |gloo < |uloo <1 
and let q vary, we conclude that the resulting power series in g with com- 
plex coefficients are equal coefficient-wise. Then letting u vary, we deduce 
that the coefficients are formally equal as rational functions of wu. Hence 
we have an equality of formal power series in Q(u)|q]. 

Next we prove that ¢ is a homomorphism. Given u;,u2 € K*, we 
put uz = u,u2g and must prove that 


P3 = P, + P, where P; = (ui), = 1, 2,3. 


In view of the periodicity ¢(qu) = ¢(u), we may restrict consideration to 
values of u; and ug in the ranges 


lq| < Jui] <1 and 1 < Jug| < |q|~*, which means that |q| < Ju3| < |q|~!. 


Then all three u; are within the domain of convergence of the power series 
expressions for X,Y € Q(u) [gq] described above. 

Since ¢(1) = O by definition, the relation P; = P, + P2 holds triv- 
ially if wu; = 1 or wg = 1. Using the functional equations for X(u~',q) 
and Y(u~',q) and the fact that P, + Pp = O if and only if 2; = x2 
and yi + y2 = ~21, it is also not hard to verify that P3; = P, + P 2 in the 
case that uiug = 1. So we are reduced to the case that P,, P2, and P3 are 
all different from O. We write P; = (2;,y;); that is, we set 2; = X(uj, q) 
and y; = Y(uz,q) for i = 1, 2,3. 

Suppose first that 2; # x2. Then writing out the addition law on E,, 
we see from [AEC, III.2.3] that the relation P, + Po = P3 is equivalent to 
the two identities 


(x2 — ©1)?ax3 = (yo — 91)? + (yo — 1) ("2 ~— 21) — (v2 — 21)*(21 + 22), 
(x2 — t1)y3 = —((y2 — yi) + (@2 — 21) x3 — (yit2 — yor). 


Now we can argue as above that (1.1) implies that these identities hold for 
all complex numbers w1, ue, g in the specified ranges. Hence they are iden- 
tities in the ring Q(u1, u2)[¢] of formal power series in g with coefficients 
that are rational functions of wu; and ug, and so are true for uw), u2,q € K. 

To deal with the remaining case 7, = £2, we could use the duplication 
formula, or we could invoke a p-adic continuity argument, but perhaps the 
simplest solution is to observe that 21 = £2 if and only if P, = +P2, and 
then use the following lemma. 


Lemma 3.1.2. Let ¢ be a map of a (multiplicative) group into an (addi- 
tive) group which takes on an infinite number of distinct values and satisfies 
the identity 


P(uiu2) = O(u1) + O(u2) — whenever ¢(ui) # (uz). 
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Then ¢ is a homomorphism. 


PROOF. Given any wu; and we, the fact that ¢ takes on infinitely many 
distinct values means that we can choose a u such that 


o(u)  +6(u1), o(u) # —$(u1) + P(ua), b(u)  £¢(usua). 
Then ¢(uu1) = o(u) + d(u1)  +G(uz), so 
$(u) + (ur) + G(u2) = (uur) + O(u2) = O(uuiue) = o(u) + P(uru2). 


Canceling ¢(u) gives @(u1)+¢(u2) = (uiu2), valid for all u; and wz, which 
shows that ¢ is a homomorphism. Oo 


To finish the proof that our ¢ : K* — E,(K) is a homomorphism, 
we need merely observe that ¢ certainly takes on infinitely many distinct 
values. For example, the series for X(u,q) shows that for any t € K 
with |¢| < 1, we have |X(1 + t,q)| = |t/~?. Hence (3.1.2) applies in our 
case. 

So we now know that ¢ is a homomorphism of K* into E,(K). That 

the kernel of ¢ is g@ is apparent from its very definition. It remains to 
prove that ¢ is surjective. This is the hardest part of the proof, which we 
will leave to the next section. 
(d) As noted above, the series for X(u,q) and Y(u,q) converge in the 
complete field K(u), so it really suffices to prove (d) for ¢ € Gr, where L 
is any finite Galois extension of K containing K(u). Any such o maps the 
maximal ideal of the ring of integers of L to itself, so o will preserve the 
absolute value on L: 


ja?| =|a| for allo € Gz/x and allae L. 


It follows easily from this that if }> a; is a convergent series with a; € L, 
then ()lai)” = Jaz. (See exercise 5.8.) Applying this to the series 
for X(u,q) and Y(u,q), we deduce that ¢(u)? = ¢(u’). This proves the 
first part of (d). 

For the second part, we use (c) to produce the exact sequence 


Lesg? sk eee 07g pee 


We now know that the maps in this exact sequence commute with the 
action of Gz. Hence for any algebraic extension L/K, we can take Gg 1, 
invariants of this short exact sequence to obtain the exact sequence 


Lg Sy, 


To obtain surjectivity on the right, we observe that it suffices to prove 
surjectivity in the case that L is a finite extension of Q,; we will prove 
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exactly this fact in the next section. Alternatively, we may observe that 
the next term in this last sequence is the cohomology group H!(Gz /L> q’). 
Since q € K, the action of Gg ;; on ¢ is trivial, so this is just the group 
of continuous homomorphisms from the profinite group Gx, to the dis- 
crete group gq4 & Z. The only such homomorphism is the trivial one, 
so H'\(Gx,/1,,q") = 0, which proves that L* + E,(L) is surjective. This 
completes the proof of (d). Oo 


Before resuming the proof of Theorem 3.1 in the next section, we 
briefly pause to repeat Proposition 1.3 in the context of p-adic theta func- 
tions. These formulas will be used in our study of local height functions in 
Chapter 6. 


Proposition 3.2. Define a function 0(u,q) by the formula 


_ gn ce Mal 
o(u,q) =(1- uw) [J SE. 


n>1 ( 


(a) 0(u,q) converges for all u,q € Q with |q| < 1 and satisfies the func- 
tional equation 


(qu, 4) = ~—0(u, 4) 


(b) @ is related to the functions X (u, q) and Y(u,q) described in (3.1c) by 
the formulas 


(i) X(u1,q) — X(u2,q) _ _ w26(ur ua, g)0(urug', 4) 


A(u1, q)?O(u2, 4)? 
" u6(u?, q) 
2Y XxX = -— 
PRoor. The convergence of the infinite product defining @ is clear, and the 
functional equation follows formally by substituting qu for u and renum- 
bering. This proves (a). Next we observe from (1.3) that the two for- 
mulas in (b) are valid over C. Now an argument similar to that used to 
prove (3.1c) shows that they are valid over K. We will leave the details to 
the reader. oO 
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84. The Tate Map Is Surjective 


The final step in the proof of (3.1), which we postponed from the previous 
section, is to show that the map 


o:K*—+ EK), — o(u) = (X(u,), ¥(u,9)), 


is surjective. One approach is to reprove in the p-adic case some classical 
results from complex analysis concerning Laurent series. In particular, one 
proves Schnirelmann’s Theorem that a Laurent series f(X) which converges 
for all X 4 0 can be written as a convergent product 


f(X) =ex* J] - =) Il (1-*), 


lol<l ja|21 


where the product is over all roots a of f. Then one constructs the field 
of p-adic meromorphic functions on K*/g” and shows using Riemann-Roch 
that it is a field of genus 1 over K.This leads to an isomorphism with some 
elliptic curve, and after some work with the classical power series for ¢ 
and g’, one deduces that the elliptic curve is indeed the curve we have 
denoted E,. For details of this line of proof, see Robert [1], Roquette [1], 
and Tate [9]. We will take a more computational, geometrically inspired 
approach. However, we should note that the theory of p-adic analytic 
functions has many important applications in modern arithmetic geome- 
try. The reader might consult Bosch-Giinter-Remmert [1] for a thorough 
introduction to this subject which is called rigid analysis. 

In order to prove that ¢: K* — E,(K) is surjective, we need to show 
that for any given point P € E,(K) there is some u € K* with ¢(u) = P. 
But the point P will be defined over some finite extension of K, so it suffices 
to prove that ¢: L* — E,(L) is surjective for all finite extensions L/K. In 
fact, this is even stronger than the original statement of (3.1c), although 
it is precisely the result we needed to complete the proof of (3.1d). For 
notational simplicity, we will write K in place of L, so we are reduced to 
showing that for any finite extension K/Q, and any q € K with |q| < 1, 
the map ¢: K* — E,(K) described in (3.1) is surjective. 

We also set the following notation which we will use for the remainder 
of this section: 


R the ring of integers of K, 

mM the maximal ideal of R, 

T a uniformizer for R, Mt=7R, 

k the residue field of R, k = R/M, 


ord, the normalized valuation ord, : K* — Zon K. 
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The group E,(K) admits the usual filtration (see [AEC, VII §2]) 
E((K) > Eqo(K) > Eq1(K), 


where 


Eqo(K) = {P € E(k) : 
Egi(K) = {P € E,(K) : 


P € Eqgns(k)}, 

PaO}: 

Here E,/k is the reduction of E, modulo 9, and Eqns are the non-singular 
points on E,. From [AEC, VII.2.1] and [AEC, VII.2.2] we have isomor- 
phisms 


Eqo(K)/Eqi(K) = Eg, Lane(h) and Eqi(K) = E(m) 
PHP =(z,y)> = 


where FE is the formal group of E [AEC, IV §1]. 
Similarly, the quotient group K*/q” has a natural filtration 


K* /q¢® > R* > Ri, 


where 
= {ue R:u=1(mod M)} 


is the group of 1-units in R. There are also isomorphisms 


R*/R*=k* and Rt =G,(M) 


area url—u, 


where G,, is the formal multiplicative group [AEC, VI.2.2.2]. We are going 
to prove not only that the map ¢: K*/q” — E,(K) is an isomorphism but 
that it respects the filtrations we just described. 

We begin with the formal groups. First, from the formula for X(u, q), 
it is clear that 


u = 1(mod MM) => ord, (X(u,q)) <0, 


since only the term will be non-integral. This proves that 


(l—u)? 
@( RI) C Eg i(K). 


Next we show that this inclusion is an equality. 
Using the isomorphisms described above, we look at the map 


~ 


Em(M) > RE Eqi(K) => = E,(m) 
; X(1+t,q) 
Y(1+t,q)° 
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Note that as sets, G, (90) and E,(M) are just the set 92; they merely have 
different group structures attached to them. If we substitute u = 1+¢ into 
the series for X(u,q) and Y(u,q) and expand as Laurent series in t, we find 
that 

Kitt, = (1 +> pet) and Y(1+t,q)= i3(14 se Bat”). 

m>1 m>1 

with coefficients Qm,3m € R. By taking the ratio of X and Y, we are 
reduced to showing that if y1,y2,... € R, then the map 


yw: MN — M, tr t(1+ > mt”) 
m>1 
is surjective. This follows immediately from [AEC, IV.2.4], which asserts 
the existence of a power series \(T) € R[T] satisfying (A(T)) = T (ie., v 
is surjective, since for any w € I we have \(w) € M and ¥(A(w)) = w.) 
Next. we look at the behavior of ¢ on R*. If we take the series 
for X(u,q) and reduce it modulo IN, we see that 
u 
= ——— ll : 
X (u,q) Gouer (mod Mt) for allu € R*, 
so $(R*) C Eqo(K). Since o(R}) = Ey1(K) from above, we get a well- 
defined injective homomorphism on the quotient groups 
ke RRL —*) Byo(K)/Eya(K) = Eqna(k) 
ie Sora See 
(CouPe' Cass 
This map k* > Eqns(k) is clearly surjective, the inverse being 
2 
y 
(x,y) i? 73” 


U 


so the map on the quotient groups is an isomorphism. (Note that Ey has 
the equation y? + zy = z°.) Then the commutative diagram 


1 — RI — RR —- k* — 1 


\ \¢ [ 
0 — Eqi(K) — Eyo —- Eqns(k) — 0 


implies that the map ¢: R* —> E,,o9(K) is an isomorphism. 
We are left to show that the injective homomorphism 


g: K*/R*q? — E,(K)/Eqo(K) 
is surjective. The group on the left is easy to describe, since the map 
K*/R*q? —> Z/ord,(qgZ 
u — ord, (wu) 


is clearly an isomorphism. So the following proposition will complete the 
proof of (3.1). 
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Proposition 4.1. 


#Eq(K)/Eqo(K) < ordy(q). 


To prove this estimate, we will use geometry to divide Ey(K) into 
several subsets, and then we will show that these subsets actually corre- 
spond to the cosets of E4,9(K) inside E(K). We start with an elementary 
characterization of the points in FE, 9(K). 


Lemma 4.1.1. Let P = (x,y) € E,(K). The following are equivalent: 
(i) P€ Ey 0(K), 

(ii) je) od, 

(iii) fy 21. 


Proor. Taking partial derivatives of the equation for E,, we observe that 
P € Eqo(K) => |y—3a?—a4)>1 or |2y+a2)>1 


<> max{|y — 32x7|, |2y+2|} >1 since ja4| = |q| <1 
=> max{|z|,|y|} > 1. 


Suppose first that |z| >1 > |y|. Then 
|x|? = |y? + xy — aga — a6] < max{|y|?, |zyl, |aaz|, lao] } < max{1, |x|}. 


This strict inequality is a contradiction, so |x| > 1 implies |y| > 1. Similarly, 
the assumption |y| > 1 > |z| gives the contradiction 


ly|? = |x? + aga + ag — xy] < max{1, |yl}, 
so |y| > 1 implies |z| > 1. This proves that 
max{|z|, |y|} 2 1 + |x| > 1 and |y| > 1, 


so (i) implies (ii) and (iii). 

Conversely, if P ¢ Ego(K), then P = (x,y) reduces to the singular 
point (0,0) of E,(k), so |z| < 1 and |y| < 1. Hence either of (ii) or (iii) 
implies (i), which completes the proof of the lemma. oO 


Next we use similar criteria to partition the points of E,(K) that do 
not lie in Ego(K). 


Lemma 4.1.2. Let P = (x,y) € Eq(K) \ Eq,o(K). Then exactly one of 
the following three conditions is true: 

(i) 1>|y| > |a+ yl, in which case |y| > |q|?, 

(ii) 1>|x+y| > |y|, in which case |x + y| > |q|?, 
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(iii) [yl = ja + y| = [a]? 
(Note that (iii) can only occur if ord,(q) is even.) 


PROOF. Let 
n = min{ord, z, ord, y} and N = ord, q. 
Dividing the equation for E, by x2”, we obtain the equation 
ye + 2nYn = ie + 2~"a4(q)tn + 77" a6(q), 


where 
LIn=n "c£ER and YIn=n "YER. 


(In fancy terminology, we’ve blown up the scheme E,/Spec(R) to find 
the affine subscheme on which P lies.) Since |a4(q)| = Jae(q)| = lq], it is 
immediate from this equation that 7~?"ag(q) € R, so 


1 1 1 
l<n< 3 ody ae(q) = 3 orde (9) = aN. 


(The fact that 1 <n comes from (4.1.1).) 
We row consider two cases. First, ifn < 5N , then reducing the above 
equation modulo a gives 


ye + 2nYn = 0(mod 7). 
This means that either 
mn =O(mod7), or Ynx+2%n =O0(mod zt), or both. 
But they cannot both be zero, since otherwise 2, = yn = 0(mod 7), which 


would contradict the definition of n. Hence one of the following two asser- 
tions is true: 


(i) Yn #0(mod 7) and yy + tp = 0(mod 7), 

(ii) Yn = 0(mod zm) and yn + tpn #0 (mod zr). 
These correspond to (i) and (ii) in the statement of the lemma. For exam- 
ple, for (i) we find that 


4 Th 
yl = le" yn| = [aI > Igl? = and jy +2] = |a"(¥m +2n)| < [zI" =Iyl- 


It remains to deal with the case n = ZN. Since ag(q) = —q+-°--, our 
equation becomes 


ye + 2nYn = (-g/7?") = 0(mod 7). 
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Hence |yn| = |Yn + 2n| = 1, which implies that 
1 1 
lyl = ly +2] = [a|” =| ]2% = |ql?. 
Oo 
Lemmas (4.1.1) and (4.1.2) allow us to divide E,(K) into the following 
subsets: 


Eyo(K) = {(x,y) € Eq(K) : |x| > 1 or |y| > 1}, 
Un = {(z,y) € Eq(K) : |x|" = |yl > |x + yl}, 
Vn = {(t,y) € Eq(K) : |x|” = |x+y| > lyl}, 
W = {(x,y) € Eq(K) : ly] = |e + yl = Iai? }. 


Notice that (4.1.2) says that U, and V, are empty unless n < 4 ordy q, 
so E,(K) can be written as the union 


E(K)=Eg(K)UWU [J (U,UV,). 


l<n<$ ord, q 


Further, if ord, q is odd, then W = @. So we have partitioned E,(K) into 
(at most) ord,(q) pieces. The final step in the proof of Proposition 4.1, 
which will also complete the proof of Theorem 3.1, is to show that these 
subsets are the cosets of Eg,o(K) in E,(K). More precisely, it will suffice 
to show that two points in the same subset are in the same coset, since 
this will imply that the number of cosets is no larger than ord,(q). This is 
exactly what we do in the following lemma. 


Remark 4.1.3. A more intrinsic explanation for the above decomposition 
of E,(K) is that the subsets U,, V,, and W are neighborhoods of the 
non-identity components of the special fiber of the Néron model of Eq 
over Spec(R). This decomposition may be compared with the description 
of special fibers of Type I, in (IV §8), Tate’s algorithm (IV.9.1), and the 
discussion in (IV.9.6). 


Lemma 4.1.4. Let P,P’ € E,(K) be points satisfying any one of the 
following conditions: 


(i) P,P'’€Un; (ii) P,P/€Va; (iii) P,P’ EW. 
Then P — P! € Eqo(K). 


PROOF. The proof of this lemma is completely elementary, although some- 
what computationally involved. We merely have to combine the geometric 
description of U,, Vn, and W with the algebraic formulas giving the group 
law on Eg. 

If P = P’, there is nothing to prove. Assume for now that also P # 
—P’. (We'll deal with P = —P’ at the end.) Then writing P = (z,y) 
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and P’ = (2’,y’), the addition law on Ey, and a little algebra yield the 
formula 

_wtytrytety) 


x(P — P’) Gann cag 


In all three cases, (i), (ii), and (iii), we have |z| < 1 and |2’| < 1, so (4.1.1) 
and the formula for z(P — P’) gives 


P—P'€Eqo(K) => |x(P-P’)|>1 
<=> lyty’+2'|-lyt+rty'| > le—2'/?. 


Suppose first that P,P’ € U,. Then 
jz] =|y|=|r|" and |z"| = |y'|=|n|", so |x —2"| < |x|”. 
On the other hand, since |y| = |x|" and |y’ + 2’| < ||", we get 
lyt+y!+2"| = |r|"; 
and similarly |y’| = ||" and |y + z| < |x|", so 
lytaty'| =n”. 
Therefore 
lyty te2'|-lytaoty'| = lal?" > \2—2')?, so P—P’ € Eyo(K). 
This proves (i). 


The proof of (ii) can be done in a similar fashion, but it is even easier 
to observe that 


PeU, = -P€EYV,. 
This follows from the formula —(zx, y) = (x,y — x). Hence (i) implies (ii). 
Further, since U,, and V, are disjoint, we see that P # —P’ in cases (i) 
and (ii). 
We turn now to case (iii), which is the most difficult. We claim that 
in this case we have 


yterty|=lyty +2. 
To see this, we note that 


1 1 
|x| < max{|z+y|,|yl} = lal? and |x| < max{|x’ + y’I, |y'l} = lal?: 
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then we compute 


lytaty/)y—(yty+e')y'| 

= |(y? + ay) -(y? +2'y')| 

= |(x? + ayx + ag) — (c”? + aga’ + ag)| 

= |e —2"|- |x? — a2’ +2" +a4| 

< |x —2'|\g|_ since |x|, |x"| < |q|?, |aal = [al 

=|yt+et+y)-(+y +2’) - Ia 

<max{lytae+y'|,lyt+y’ +2'|} - Iq 

=max{|(yt+ety')yl, |y ty’ +2')y'|}- lal? 
since |y| = ly’| = |al?, 

< max{|(y+2+y’)y|,|\(y+y! + 2')y'|} 

since |g| < 1. 


The only way that this strict inequality can possibly be true is if 
lytoty')yl =|yty' +2')y’l. 
Further, we know that |y| = |y’|, so we have proven our claim 
lytaty'|=lyty' +e’. 
Using this equality, we compute 
Jz—2'P =|(ytaty')—-(yty' +2’)? 


2 
<max{ly+z+y'|,lyty’ +2} 
=lytaty'|-lyty' +2. 


From above, this inequality implies that P — P’ € Eqo(K). 

Finally, we must deal with the case P’ = —P € W, so P — P’ = 2P. 
We could argue by continuity, but here is a direct argument using the 
duplication formula, which on E, reads 


xt — 2agx? — 8agx + az 746 f(z) 


P— P’) =2(2P) = ——_—__ Es : 
a jstlee) 4x3 + x? + daar + dag g(x) 


Here f(x) and g(x) are the indicated polynomials. From general principles, 
one knows that f(x) and g(x) are relatively prime in K[z]. More precisely, 
if we let 


F(x) = 48a +8r+64a4—1 and G(x) = 122°—x?—20a4r +2a4—108a6, 
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then a little algebra suffices to verify the relation 
f(x) F(a) — g(x)G(a) =A, 


where A is the discriminant of the Weierstrass equation for E,. Substitut- 
ing f(x) = x(2P)g(z) into this relation gives 


g(x) {x(2P) F(x) — G(z)} =A. 
We are assuming that P € W, so 


2 
|9(z)| = |2y + 2]? < max{|y|, ly +2]}° = lal, 
|G(x)| = |1223 — 2? — 20agz + 2a4 — 108ag| < max{|z|”, lal, |ae|} = ld\, 
| F(a) | = |48a? + 8a + 64a, —1| = 1, 
|A| = |q — 24g? + 25293 —---| = |q]. 


Hence A 
1< Fol = |x(2P)F (2) — G(z)| 


< max{|x(2P)| - |F(2)|, |G(z)|} 
< max{|z(2P)|, |q|}. 


Since |g| < 1, it follows that |x(2P)| > 1, so 2P € Eq,o(K) from (4.1.1). 
This completes the proof of Lemma 4.1.4, and with it the proofs of Propo- 
sition 4.1 and Theorem 3.1. 

oO 


§5. Elliptic Curves over p-adic Fields 


In the previous two sections we have shown that for any p-adic field K/Q, 
and any q € K* with |q| < 1, the quotient group K*/q” is (analytically) 
isomorphic to an elliptic curve E,(K ). In the analogous situation over 
the complex numbers, we know (1.1d) that every elliptic curve E/C is 
isomorphic to EK, for some g € C*. However, in the p-adic case we have 


1 i 
|j(Eq)| = 7 t 744 + 196884q ++] = > 1, 


lal 


so it is clear that not every elliptic curve over K can be isomorphic (over K) 
to an E,. A necessary condition is | j (E)| > 1. We begin by showing that 
this condition is also sufficient. 
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Lemma 5.1. Let a € Q» be an element with |a| > 1. Then there is a 
unique q € Q) with |q| < 1 such that j(E,) = a. This value of q lies in 


Qp(a). 


Proor. The j-invariant of E, is given by the series (3.1), which we write 
as 


: 1+ 744q + 196884q? + --- 
i(Q) = — 
q 
The reciprocal of this series, which we will call f(g), is given by the formula 


fq ==> = q 
VY 5g) 14-744q + 196884q2 + --- 


= q — 744q” + 356652q° — --- € Z[q]. 


Applying [AEC, IV.2.4] to the series f, we get a series g(q) = q+--- € Z[q] 
such that 9( f (q)) = q as formal power series in Z[gq]. Since g has integer 
coefficients and leading term q, it will converge if we evaluate it at any 
element @ € Q, of absolute value less than 1 and will satisfy |g(@)| = |4]. 
In particular, since |a| > 1, we find that 


a=9(+) € O(a) 


satisfies 


1 1 1 1 
la| =| i@ fMN=fi{9lo : 
Hence j(q) = @ as desired. This proves the existence part of (5.1). 
To prove uniqueness, suppose that j(q) = j(q’) with |q| < 1 and |q’| < 
1. Then f(q) = f(q’), so 
0= |f(a) — f(a’)| 
= |q—4'|-|1 —744(q +9’) + 356652(q? + qq’ +9”) + ---| 
=l|q-4'l.- 
Therefore q = q’. oO 
Before proving the p-adic uniformization theorem, we describe an in- 
variant which is useful for studying the twists of a curve. 
Lemma 5.2. Let E/K be an elliptic curve defined over a field of char- 


acteristic not equal to 2 or 3, and choose a Weierstrass equation 


y? + a,ry + azgy = ge + agu” + a4x + ag 
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for E/K. Let cq and cg be the usual quantities [AEC, III §1] associated to 
this equation. Assuming that j(E) # 0,1728, we define 


y(E/K) = —ca/eg € K*/K*?. 


(The reason for the negative sign will become apparent later when we prove 
that y(E,/K) =1.) 

(a) y(E/K) is well-defined as an element of K*/K*?, independent of the 
choice of Weierstrass equation for E/K. 

(b) Let £’/K be another elliptic curve with j(E') # 0,1728. Then E 
and E’ are isomorphic over K if and only if 


j(E) =j(E') and y(E/K) = ¥(B"/K). 


(c) Let E/K and E'/K be elliptic curves with j(E’) = j(E) # 0,1728, 
and suppose that y(E/K) #4 7(E’/K), so 


_.( EH 
as (, “(E'/K) 


is a quadratic extension of K. Let 
xi GRRE Oo Gik Sey 


be the quadratic character associated to L/K. Then there is an isomor- 
phism 
y:E— E’ 


with the property that 


WP?) =x(o)¥(P) for allo € Gg x and all P € E(K). 


Proor. (a) The condition j(£) 4 0, 1728 is equivalent to cy #4 0 and cg # 
0, so y(E'/K) exists. If we choose a new Weierstrass equation for E/K, then 
the new cq and cg are related to the old ones by the formulas utc = C4 
and u®c, = ce for some u € K*. (See [AEC, III Table 1.2]. The fact 
that u € K follows from [AEC, III.3.1].) Hence 


/ 

Cc C4 C4 
4aiy2F=at (mod Ke), 
ro C6 C6 


which proves that y(£/K) is independent of the chosen Weierstrass equa- 
tion. 
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(b) If E and E’ are isomorphic over K, then [AEC, III.1.4b] asserts that 
j(E) = j(E’). Further, since the Weierstrass equations for E and E” are 
Weierstrass equations for the same elliptic curve over K, it follows from (a) 
that 7(E/K) = 7(E’/K). 

Conversely, suppose that j(E) = j(£’) and 7(E/K) = 7(£’/K). Since 
the characteristic of K is not 2 or 3, we can find Weierstrass equations for EF 
and E’ over K of the form 


E:y=2°+Ac+B, E':y=ae°+A'r+B’, 
with A, B, A’, B’ € K. The fact that j(F£) = j(E’) 4 0, 1728 implies that 


a renee 4A3 
Be = Be’ since j(E) = 1728773 o7B? 


Similarly, since cy = —48A and cg = —864B, our assumption 7(E/K) = 
(E'/K) means that 
2A _ 4 


B=a = —7(E/K) = -y(E'/K) = z => 


so there is some t € K* such that AB’ = t?A’B. Using these relations 
between A,B and A’, B’, it is now easy to check that the map 


E—E’',  (z,y)-> (t?2,t°y) 


is a K-isomorphism. 
(c) We take models for E/K and E’/K as in (b), and again the assump- 
tion j(E) = j(E’) ¥ 0,1728 implies that 43B’? = A” B3. Next we let 


Oy cy eee 2 WE/K) #2 
t= ces V AB’ so ¢t = (EK) (mod K**). 
Since 7(E/K) 4 7(E’/K), we know that L = K(t) is a quadratic extension 
of K; and as in (b), the map 
bi E— E',  (a,y)+—> (ta, ty) 


is easily seen to be an isomorphism. Finally, for any o € Gg;%, we know 
that t? = x(o)t. So for P = (z,y) € E(K) we have 


W(P)? = V(2,y)" = (Pa, By)? = (x(0)?t727, x(0)?t3y7) 
= (Pet aay) = via ary) = x(o)(P2). 


We are now ready to prove Tate’s p-adic uniformization theorem, which 
applies to all curves whose j-invariant has absolute value greater than 1. 
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Theorem 5.3. (Tate) Let K be a p-adic field, let E/K be an elliptic 
curve with |j(E)| > 1, and let 7(E/K) € K*/K*? be the invariant defined 
in (5.2). 

(a) There is a unique q € K* with |q| < 1 such that E is isomorphic over K 
to the Tate curve E,. Further, this value of q lies in K. 

(b) Let q be chosen as in (a). Then the following three conditions are 
equivalent: 


(i) | E is isomorphic to E, over K. 
(ii) y(E/K)=1. 
(iii) E has split multiplicative reduction. 


PrRooF. (a) From (5.1) there is a unique g € K* with |g] < 1 such that 
j(Eq) = j(E). This implies [AEC, III.1.4b] that Ey is isomorphic to E 
over K, which completes the proof of (a). 

(b) From (5.2) we know that E is isomorphic to E, over K if and only 
if j(£) = j(E,) and 7(E/K) = 7(E£,/K). So in order to prove that (i) 
and (ii) are equivalent, we must show that y(£,/K) = 1. Using (3.1) we 
find that the cq and cg values associated to the Tate curve 


Eq: y? + ry = 2° + a4(q)z + a6(q) 


are 
ca(q) = 1 — 48a4(q) = 1 + 240s3(q), 


e6(q) = —1 4+ 72a4(q) — 864a6(q) = —1 + 50455(q). 


So the y-invariant of E,/K equals 


ca(q) _ 1+ 240s3(q) 


*2 
BQ. tna, Oe 


y(Eq/K) = — 


To see that 7(E,/K) is a square, we use the following elementary calcula- 
tion which implies that c4(q) and —cg(q) are themselves squares in K. 


Lemma 5.3.1. Let a€ K with |a| <1. Then 1 + 4a is a square in K. 


PROOF. We first observe that the binomial coefficient 


n n} 4n n 


eer (33) (3) CAR) = a (2) 
is an integer divided by 4”. Hence the coefficients of the series 


(1-+4a)7t = +S Salon = ye Cran 


n=0 
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are integers, so the series converges in K. Therefore (1+4a)~! is a square 
in K, so the same is true of 1+ 4a. This completes the proof of the lemma, 
and with it the fact that (i) and (ii) are equivalent. oO 


Next we note that since |a4(q)| = |ae(q)| = |q| < 1, the equation of 
the reduced curve Ey is 


E,:y +2y=2', 


which clearly has split multiplicative reduction. This shows that (i) im- 
plies (iii). 

Conversely, suppose that & has split multiplicative reduction. We will 
show that 7(£/K) = 1, which will prove that (iii) implies (ii). Take a 
minimal Weierstrass equation for E, 


E:y? +azy + agy = 2° + aga? + age + ag. 


Making a linear change of variables, we may assume that the singular point 
modulo 2M is the point (0,0), where as usual we write IN for the maximal 
ideal of the ring of integers of K. Then the fact that (0,0) is on the curve 
and singular modulo IN implies that 


a3 = a4 = ag = 0(mod M), 
and hence that 
bs = a1a3 + 2a4 =0(mod Mt) and cy = b3 — 24b4 = b2 (mod M). 


From [AEC, VII.5.1b], the fact that E has multiplicative reduction 
implies cs # 0 (mod Mt), so we see that bo 4 0(mod M). It follows that be 
is a unit (i.e., [bo] = 1). Hence 


a | 1 ~ 2478 
y(E/K) =-— = 5 Sere TRENT (mod K**), 
6 7 \ 1-365 +2164 
b3 b3 


Applying (5.3.1) to the numerator and denominator of the bracketed frac- 
tion on the right-hand side of this equation, we find that 


1 
V(E/K) = a= b. (mod K*?), 
2 
It remains to show that if the multiplicative reduction of E is split, then be 


is a square in K*. 
Note that the reduction of EF is 


BE :y? + acy = 2° + don”. 
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We factor the polynomial 
y? + dry — Gar” = (y — &x)(y — Ba). 


The fact that EF has multiplicative reduction means that E has a node, 
so & # @; and the fact that the reduction is split means that @ and # are 
actually in the residue field of K, rather than in a quadratic extension. 
(See [AEC, III §1, VII §5].) It follows from Hensel’s lemma applied to the 
polynomial T? + a,T — ag that & and @ lift uniquely to elements a, 3 € K 
such that 

y? + airy — apx = (y — ax) (y — Ba). 


Hence 
by = a? + dag = (—a — B)? + 4(—aB) = (a — B)? € K*?, 
so 7(E£/K) = be =1 (mod K?*). 


We have now proven (ii) <=> (i) ==> (iii) => (ii), which completes 
the proof of Theorem 5.3. oO 


Suppose that we have an elliptic curve E/K as in Theorem 5.3 with 
invariant y(E/K) #1. IfweletL=K (Vx(E7K)), which is well-defined, 


since 7(E£/K) is defined up to squares in K, then it is clear that y(E/L) = 
1. Applying (5.3) to E/L, we find that E is isomorphic to E, over L, so 


E(L) & E,(L) = L*/q*. 
We will now describe E(K) in terms of this identification. 


Corollary 5.4. With notation as in the preceding paragraph, 


E(K) © {ue L*/q@ : NE(u) € q2/q22}. 


ProoF. First we observe that the norm map N¥ is a homomorphism 
NE . L* /q® cae K*/q"", 


so NX-(u) is well-defined modulo q?4. Applying (5.2c) to E and Ej, there 
is an isomorphism = _ 

v: E(k) — E(k) 
satisfying (P’) = x(o)¥(P)° for all o © Gg;x, where x : GxsK > 
Gi/K — {+1} is the quadratic character associated to L/K. On the other 
hand, the isomorphism ¢ : K*/q” — E4(K) is defined over K, which means 
that ¢(P’) = ¢(P)”. We look at the composition 


Lt /q? 2+ E,(L) *» E(L), 
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which we know from above is an isomorphism of groups. Let 7 © Gx K 
be an element with y(7) = —1, so 7 represents the non-trivial element 
in Gy/x. Then for any ue€ L*, 


(pod)(u) € E(K) => ¥(d(u))" = ¥(9(u)) 
<=> —v((u7)) = 4(¢(u)) since x(r) = —1 
<== ¥(¢(u-7)) = o(¢(u)) 
since —~(P) = y(—P) and —¢(u) = ¢(u7') 
<=> u 7 =u(mod dg”) 
since ¢ and w are isomorphisms 


=> ult ¢’. 


Since u!+7 = N(u), this completes the proof of the corollary. Oo 
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As we have seen amply demonstrated, the arithmetic properties of the 
torsion points on an elliptic curve are of fundamental importance. In the 
case that the curve has a p-adic uniformization, E(K) = K*/q”, it is easy to 
describe the torsion subgroup of FE. Further, since the p-adic uniformization 
commutes with the action of Gg/x, it is similarly easy to describe the 
action of Gz x on the torsion subgroup of E. We will not prove the most 
general theorem in this direction but will be content with the following 
fundamental result. (See also exercise 5.13.) 


Proposition 6.1. Let K bea p-adic field with normalized valuation ord,, 
let E/K be an elliptic curve with |j(E)| > 1, and let £ > 3 be a prime 
not dividing ord, j(E). Then there is an element o in the inertia subgroup 
of Gx; which acts on the ¢-torsion subgroup E|é] of E via a matrix of 


the form (4 1): In other words, there is a basis P\, P2 € El] such that 


PY =P, and Ps =P,+ Po. 


(One sometimes says that o acts as a transvection on E|£].) 


Remark 6.1.1. Recall that there is an ¢-adic representation [AEC, III §7| 


pe: GayK — Aut (Ty(£)). 


446 V. Elliptic Curves over Complete Fields 


Since E[é] = T)(£)/€T;(E), another way to state (6.1) is that relative to 
an appropriate basis, there is ao € Gg satisfying 


pto)= (4) (mod 0), 


Remark 6.1.2. It is also worth pointing out that the proof of Proposi- 
tion 6.1 does not need the full strength of Theorem 3.1. Specifically, we 
only need to know that the map ¢ : K*/q” —> E,(&) is an injective ho- 
momorphism; we do not need to know that it is surjective. The reason 
injectivity suffices is that we are really only interested in the torsion sub- 
group of E,, and a simple count shows that the there are m? points of 
order m in K*/q”, so we get (essentially for free) that ¢ is an isomorphism 
on torsion. 


PROOF. First we observe that if L/K is a finite extension of degree prime 
to @, and if (6.1) is true for E/L, then it is true for E/K. This follows from 
the equality ord, j(Z) = ew/, ord, j(E), where w is the extension of v 
to L, and the ramification index e,,/, is prime to @, since it divides [L : K]. 
Hence will not divide ord, j(£), so there is ao € Gg/p, C Gg, that 
acts as a transvection on E[é]. 

From (5.3b) we know that E is isomorphic to a (unique) Tate curve E, 
over an (at most) quadratic extension of K. So replacing K by this ex- 
tension, it suffices to prove (6.1) for E,, where q € K*. Similarly, we may 
assume that K contains a primitive ¢**-root of unity ¢, since the degree 
of K(¢)/K divides €— 1, so the degree is prime to @. 

Let Q =qt € K bea fixed £*"-root of g. Since ord, j(Eq) = — ord, q 
is not divisible by @, the Kummer extension K(Q)/K is totally ramified 
of degree @. Hence there exists a o in the inertia subgroup of Gg, such 
that Q? = CQ. We claim that this is the desired 0; it remains to pick the 
right basis for E,[é]. 

To do this, we use the p-adic uniformization (3.1) 


o: K*/q? — E,(K). 
With this identification we clearly have 
&: (6% -Q")/q" > E, IE. 

Further, the p-adic uniformization map ¢ commutes with the action of 
Galois (3.1d) (ie., ¢(P”) = O(P)7), so the action of Gg/x on E,[é| is 
the same as its action on the quotient group (¢” - Q”)/q”. As our basis 
for E,[¢|, we take the elements P, = ¢(¢) and P2 = $(Q). Then 

PY = $(¢)? = 667) = (0) = Pi, 

PZ = 6(Q)" = $(Q7) = $6Q) = 4() + OQ) = Pit Pr, 


We next observe that Proposition 6.1 remains true for certain elliptic 
curves over number fields. 
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Corollary 6.2. Let K/Q be a number field, let E/K be an elliptic 
curve, and assume that the j-invariant of FE is not in the ring of integers 
of K. Then for all but finitely many primes ¢, the image of the ¢-adic 
representation pe: Gz; — Aut(T;(E)) contains an element satisfying 


lo) = (4 1) (mod 


relative to a suitable basis for Te(F)/¢Te(E) = E[é]. 


Proor. Let v be a (finite) place of K for which j(£) is non-integral, 
so \j(E)|, > 1. Let Gz, [K, © Gx be the decomposition group of v 
for the extension of v to K corresponding to some ee K — Ky. 

Now (6.1) gives an element ¢ € Gg, x, which acts like (6 t) on E{é]. 
But with our identifications, ¢ € Gg,;q and E{é] C E(K) C E(Ky), which 
gives the desired result. Oo 


It is a legitimate question to ask why one should care that Gal(K/K) 
contains an element that acts on E[é| as a transvection. One answer is that 
this puts severe constraints on the allowable maps between such elliptic 
curves. For example, we will now give Serre’s p-adic proof that an elliptic 
curve with complex multiplication has integral j-invariant. (For alternative 
proofs of this important fact, see [AEC, exercise 7.10] and (II §6).) 


Theorem 6.3. Let K/Q be a number field, and let E/K be an el- 
liptic curve whose j-invariant j(E) is not in the ring of integers of K. 
Then End(£) = 


Proor. (Serre) We begin by recalling that there is a representation of the 
endomorphism ring of E [AEC, III §7], 


End(£) — End(Ty(E)), = w-> we. 


Further, we proved in [AEC, V.2.3] that for any 7% € End(£), this repre- 
sentation can be used to compute the degree of w via the formula 


deg(y) = det(we). 


(This result appears in [AEC] in the chapter on elliptic curves over finite 
fields, but the proof depends only on the non-degeneracy of the Weil pair- 
ing, which is valid in general.) 

Let ~» € End(£) be an isogeny. Taking a finite extension of K if 
necessary, we may assume that ~ is defined over kK. This means that 


V(P?)=W(P)? for all » € Gx x and all P € E(K). 


448 V. Elliptic Curves over Complete Fields 


We need to show that w € Z. 

Let 

m = deg(1 + w) — deg(w) — 1. 

Notice that if we knew that ~ was in Z, then m would equal 27. So we 
will try to show that m = 2w by showing that the degree of m — 2y) is 0. 

Using (6.2), choose a “large” prime £, an element o € Gg/,, and an 
ordered basis {P;, P2} for E[é| so that relative to this basis, 

ee ge 

pe(o) = G i) (mod £). 


(We will see below that any @ larger than deg(m— 2) will suffice.) Looking 
at the action of 7 on E[é], we find that ~ is represented by a matrix 


he & a) (mod &) 


for some a,b,c,d € Z/€Z. In other words, o(P1) = aP; + cP2 and w(P2) = 
bP, +dP2. Now, since ~ and o commute in their action on E'(K), it follows 
that their matrices commute in End(E[é]) & GL (Z/éZ): 


11 a b\_fa b 1 1 
(oi) (E a)=(% a)(o 1) ome. 
Multiplying this out, we find that a = d and c = 0, so 
_fa ob 
v= (5 p) (mod £). 


Next we determine the relationship between a and m: 
m = deg(1 +) — deg(w) — 1 by definition of m 
= det(1 + we) — det(ye) —1 from [AEC, V.2.3] 


= l+a b 2 a b\ 
= det ( 0 es) act (4 2) 1 (mod @) 


=2a (mod £). 
This now allows us to compute the degree of m — 2w, at least modulo @. 
deg(m — 2) = det(m — 2H) from [AEC, V.2.3] again 


=aet (4 my -2(6 2 (mod £) 


=O (mod £) since m = 2a(mod £) from above. 


We have now proven that deg(m — 2W) = 0(mod £) for all primes @ 
such that (6.2) is true, which means that deg(m — 2y) = 0(mod @) for 
all but finitely many @’s. Hence deg(m — 2) = 0, som = 2y. But 
every endomorphism is integral over Z [AEC, III.9.4], so m must be even 
and w € Z. This completes the proof that End(£) = Z. oO 
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EXERCISES 
5.1. Let qg eC, |g| <1, and let k € R. Prove that 


nk gq” 
1—q’ 


se(q) = > o%(n)q” = 


n>1 n>1 


where as usual o%(n) = doa. 
d|n 


5.2. Let G be a cyclic group of order n, let o be a generator for G, and let M 
be a G-module. 
(a) Prove that 


{x EM :x2-ox=0} 


2 LO 
vol (G,M) = {e@+on+---+o0%-!z : xe M} 


(This piece of elementary group cohomology is used in the proof of (2.4).) 
(b) Prove that 


{zEM:a2+o0n+-:-+o0"'x =0} 


+ ~ 
AGY MN {e-—or:2¢€M} 


Use this directly to show that for q € R*, the Weil-Chatelet group 
WC(E,/R) & H* (Ger, C*/q’) 


has order 1 (respectively 2) if g < 0 (respectively q > 0.) 


5.3. Let E/R be an elliptic curve, let A(Z) be the discriminant of a Weierstrass 
equation for F/R, and let m be an even integer. 
(a) Prove that 


Z/mZ, if A(EZ) <0, 
E(R)[m] = 

(Z/2Z) x (Z/mZ), if A(E) > 0. 
(b) Consider the Kummer sequence 


0 — E(R)/mE(R) — H'(Geyr, E[m]) — WC(E/R) — 0. 


If A(E) < 0, prove that all three terms are 0. If A(E) > 0, prove that the 
sequence is 


0 — Z/2Z — Z/2Z x Z/2Z —+ Z/2Z — 0. 
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5.4. 


5.5. 


5.6. 


V. Elliptic Curves over Complete Fields 


(a) Let a,b € R, and let F/R be the elliptic curve 
E:y? =2° + ax’ + be. 


If A(E£) > 0, so in particular if b < 0, prove that WC(E/R) has order 2, 
and its non-trivial element is represented by the homogeneous space 


C : w? = 4b2? — (1+ a27)?. 


(Hint. See [AEC X.3.7].) 
(b) Let E/R be an elliptic curve with A(Z£) > 0. Exercise 5.3(a) says 
that E[2] C E(R), so we can factor 
4x? + box” + Qbax + be = 4(a — €1)(x — €2)(a — €3) 
with e; < €2 < e3. 

Prove that the non-trivial element of WC(E/R) is represented by the ho- 
mogeneous space 

C: —w? = 1+ 2(2e2 — e1 — e3)z? + (e1 — €3)? 2". 


Let ER be an elliptic curve, and choose g € R, 0 < |g| < 1, so that j(E) = 
j(Eq). Suppose, however, that E is not R-isomorphic to E,. Then if we 
consider the isomorphisms 


C*/q° > E,(C) > EC), 


the second map will not be defined over R. Prove that with this identifica- 
tion, 
E(R) = {ue C*/q” : |ul? € q”/q""}. 


For this problem we will write E(r) for Eg with q = e?"*, and for a 


given £'/R we write E* for the non-trivial twist of &. (See Proposi- 
tion 2.2(a).) 
(a) Prove that 


Bit)’ x E(>), for all t > 0, t #1, 

di Gee dd 

hg Os aes for allt >0,t £1, 
HG 5) kata) gaa 


1 i 
i\X = = 
E(i) = ECS +5). 


(b) Fix E/R with j(E) # 1728. From (2.3) there is a unique t > 0 so that 


Elit), if A(E) > 0, 
BX 
mL e(het! , if A(B) <0. 


Let y(E/R) = sign ce(E) be the invariant defined in (2.2b). Prove that 
y(E/R) = sign(1 — t). 
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5.7. 


5.8. 


5.9. 


5.10. 


5.11. 


Let K/Q, be a finite extension with ring of integers R and normalized 
valuation ord,. Let g € K* satisfy |q| < 1, let Eg be the corresponding 
Tate curve, and let ¢ : K* — E,(K) be the homomorphism described 
in (3.1). Prove that for every r > 1, ¢ induces an isomorphism 


co) : R; ead Eqr(K), 
where 
Rr = {ue K* : ordy(u— 1) >r}, 
Eqr(K) = {(2,y) € Eq(K) : ordy(x) < —2r} U {O}. 
(a) Let L/K/Q, be a finite tower of fields with L Galois over K, let a; € L 


be a sequence of elements such that the series }> a; converges, and let o € 
Grx. Prove that 


co foe) 
o 
O as) = y ay. 
i=l i=1 


(b) Show that (a) is true if we replace Q, by R and take K = R and L = 
C. 

(c) Find a sequence of elements a; € Q such that }> a; = /2, and deduce 
that there is an element o € G@ gq such that 


Thus (a) is not true if we replace Q, by Q. 

Fill in the details needed to rigorously prove the formulas (i) and (ii) in 
Proposition 3.2(b). 

Let K be a p-adic field, let q,q’ € K* satisfy |g] < 1 and |q’'| < 1, and 
let Eq and Ey be the corresponding Tate curves. 

(a) If Eq and E,’ are isogenous, prove that there are positive integers m,n 
such that q” = q’”. 

(b) *Conversely, if g’” = q’” for some integers m,n > 1, prove that Ey 
and Ey are isogenous. (Clearly, there are homomorphisms from K*/q 
to K*/ a". for example u — u™. What is unclear is that the corresponding 
homomorphisms E, — E,, are given by rational functions, rather than by 
power series.) 

Let K be a p-adic field, and let E/K be an elliptic curve such that |j(E)| > 
1. Let y(E/K) be the invariant described in (5.2), and consider the field 
Wek ( “{ETR) ). 

(a) Prove E/K has split multiplicative reduction if and only if L = K. 
(b) Prove E/K has non-split multiplicative reduction if and only if L/K 
is unramified of degree 2. 

(c) Prove E/K has additive reduction if and only if L/K is ramified of 
degree 2. 

(d) Let Eo(K) be the group of points whose reduction is non-singular. 
Describe the quotient group E()/Eo(K) in each of the three cases (a), (b), 
and (c). 
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5.12. 


5.13. 


5.14. 
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(a) Prove that Lemma 5.2 is still true if K has characteristic 3. 
(b) Show that if K has characteristic 2 and E/K is an elliptic curve 
with j(Z) # 0, then y(£/K) is always equal to 1. Hence Lemma 5.2 
is not true in characteristic 2. 
Let K be a p-adic field, and let E'/K be an elliptic curve with split multi- 
plicative reduction. 
(a) Prove that for each prime £ # p there is an exact sequence of Gg/x- 
modules 

1 — Tr(h) — Te(E) — Z, — 0, 


where T;(2) is the Tate module of K (see [AEC, III.7.3]) and Gz, acts 
trivially on Ze. 

(b) Prove that there is a basis for Ty(E£) so that the image of the inertia 
group of K/K in Aut(Te(E)) & GLa(Ze) is equal to 


{(6 ) € GL2(Ze) : orde(b) > orde(vx (5). 


(c) Prove that the exact sequence in (a), considered as a sequence of Gx ;x- 
modules, does not split. 

Let E,/K be a Tate curve over a p-adic field K. 

(a) Prove that there is an exact sequence 


0 — WC(E,/K) car H? (Gz/x.7) >. H? (Ga/x, K*) : 


(b) Prove that 
H? (Gx x9’) & Hom (Gz/x,Q/Z) & Hom (K*,Q/Z). 


(Hint. For the second isomorphism, use local class field theory.) 
(c) It is well known from local class field theory Serre [4] that there is an 
isomorphism H?(GRK: K*) =Br(K) & Q/Z. Prove that the map 


Hom (K*,Q/Z) = H? (Gxyx.°) — H? (Gyn, K*) = Q/Z 


obtained by composing this isomorphism with the maps from (a) and (b) 
is given by the rule f +— f(q). 
(d) Deduce that 


WC(E,/K) = Hom (E4(K), Q/Z) . 


In other words, the Weil-Chatelet group of E,/K is dual to the group of 
rational points E,(K). 

(e) *More generally, prove that WC(E/K) = Hom (E(K),Q/Z) for any 
elliptic curve E/K satisfying |j(E)| > 1. 

(In fact, it is true that WC(E/K) & Hom(E(K),Q/Z) for all elliptic curves 
over p-adic fields, not just those with non-integral j-invariant. The proof 
of this result, which is due to Tate [5,6], requires different methods than 
those used in this chapter. See also Milne {1]) 
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5.15. 


5.16. 


Let K/Q, be a p-adic field, let E/K be an elliptic curve, let N > 5 bea 
prime not equal to p, and suppose that there is a point P € E(K) of exact 
order N. 

(a) Prove that E has either good or multiplicative reduction. 

(b) Let E — E’ be an isogeny of elliptic curves whose kernel is the cyclic 
subgroup generated by P (ie., E’ = E/ZP). Prove that 


uK(De/xK) a Bl vK(De'/K) =0 (mod N+ 1). 


Here ux : K* — Zis the normalized valuation on K, and Dex and De/K 
are the minimal discriminants of E/K and E’/K respectively. (Hint. Take 
Tate models Eg and Ey and look at the isogeny Ey > Ej.) 


Let E/K be an elliptic curve defined over a number field, let N be a 
prime, and suppose that there is a point P € E(K) of exact order N. Use 
the previous exercise and Szpiro’s conjecture (IV.10.6) to prove that N is 
bounded by a constant that depends only on the field K. This approach 
to proving the boundedness conjecture [AEC, VIII.7.7] is due to Frey. 


CHAPTER VI 


Local Height Functions 


The canonical height function 
h: E(K) — [0, co) 

is a quadratic form whose value at a point P measures the arithmetic 
complexity of P. The importance of the canonical height stems from the 
fact that it relates the geometrically defined group law to the arithmetic 
properties of the algebraic points on E. See [AEC VII, §9] for details. 

Recall that the ordinary height of a non-zero point P € E(K) (relative 
to the function x) is defined as a sum of local terms, one for each absolute 
value. Thus 


h(P) = aK KO] so ny, max{—v(x(P)),0}. 


veEMK 
The canonical height is then the limiting value of these ordinary heights, 
- ; 1 
h(P) = lim ja h([n|P). 
It is natural to ask whether the canonical height itself can be natu- 
rally decomposed as a sum of quadratic forms, one for each absolute value 
in K. The answer is “no,” but Néron and Tate have shown that there is a 


decomposition into local functions which are almost quadratic. Precisely, 
they show that for each v € Mx there is an almost quadratic function 


Ay: E( Ky) \ {0} —R 
such that 
A(P) = ar) a) S> mAv(P) for all P € E(K) < {O}. 


veEMxK 

In §1 of this chapter we will use an averaging argument to prove the 
existence of local height functions for all absolute values v € Mx, and in 
§2 we will prove that the canonical height is equal to the sum of the local 
heights. It is also of interest to have explicit formulas for the local height 
functions, so we will give such formulas for archimedean absolute values in 
§3 and for non-archimedean absolute values in §4. For further information 
about local height functions, see for example Lang [3] and Zimmer [2]. 
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§1. Existence of Local Height Functions 


Let K be a field and let | - |, be an absolute value on K. The absolute 
value can be used to define a topology on K in the usual way: a basis of 
open neighborhoods around an element a € K is the collection of (open) 
balls 

U.={BEK : |B-aly <e}, alle > 0. 


Let E/K be an elliptic curve. In a similar way we can define a topology 
on E(k). 


Definition. The v-adic topology on E(K) is defined as follows. For a 
point Po = (xo, yo) € E(K), a basis of open neighborhoods of Po consists 
of the sets 


U. = {(2,y) € E(K) : |x — 2oly < € and |y — yo| < e}, alle > 0. 


For the point O € E(K) at infinity we take as a basis the open neighbor- 
hoods 


U. = {(2,y) € E(K) : |z|p > e7'} U {O}, alle > 0. 


(Notice that for the neighborhoods of O there is no need to require both |z|,, 
and |y|, to be large. The Weierstrass equation ensures that they simulta- 
neously go to oo. For an alternative definition of the v-adic topology, see 
exercise 6.1.) 

In this section we will prove the existence of almost quadratic local 
height functions. These will be certain continuous functions 


E(K) \{O} =R, 


where E'(K) \ {O} is given the v-adic topology induced from E(k), and R 
is given its usual topology. The following formulation is due to Tate. 


Theorem 1.1. (Néron, Tate) Let K be a field which is complete with 
respect to an absolute value | - |,, and let 


v(-) = — log] - |e 


be the corresponding additive absolute value. Let E'/K be an elliptic curve. 
Choose a Weierstrass equation for E/K, 


E:y? +ary + a3y = 2° + agx* + a4z + a6, 


and let A be the discriminant of this equation. 
(a) There exists a unique function 


A: E(K)\~{O} —R 
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with the following three properties: 


(i) X is continuous on E(K)\{O} and is bounded on the complement 
of any v-adic neighborhood of O. 
(ii) The limit 
Jim {A(P) + 5u(x(P))} 
SO 
exists. 
(iii) For all P € E(K) with [2|P £0, 


([2|P) = 4A(P) + v((2y + ai@ + a3)(P)) — f0(A). 
(b) A is independent of the choice of Weierstrass equation for E/K. 


(c) Let L/K be a finite extension and w the extension of v to L. Then 
(with the obvious notation) 


Aw(P) =A,(P) for all P € E(K) \ {O}. 


Definition. The function A described in Theorem 1.1 is called the (local) 
Néron height function on E associated to v. 


Remark 1.1.1. For other properties of A which are equivalent to (iii), see 
exercises 6.3 and 6.4. 


PrRooF (of Theorem 1.1). (a) We begin with uniqueness. Let 
A,r’: E(K) . {O} —R 


be two functions satisfying (i), (ii), and(iii); and let A = \ — » be their 
difference. From (ii) we see that the limit 


po 


exists; so if we define A(O) to be this limiting value, then (i) implies that 
A: E(K) —R 
is a continuous bounded function on all of E(k). 
Next we observe from (iii) that A((2]P) = 4A(P) provided [2]P 4 O. 
But the points satisfying [2]P = O form a discrete subset of F(A) (notice 


there are at most four such points), so by continuity A([2)P) = 4A(P) 
holds for all P. Iterating this relation N times and dividing by 4% gives 


A(P) = pro" P), valid for all P € E(K) and all N > 1. 
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Since A is bounded, we may let N — co to deduce that A(P) = 0. 
Hence \ = ’, which proves uniqueness. 

Before proving the existence of A, which is more complicated, we will 
prove (b) and (c). 

(b) It is clear that conditions (i) and (ii) are independent of the choice of 
Weierstrass equation. Since the quantity 

(2y +a,z + a3)4 

A 

is invariant under change of coordinates (see [AEC, III §1]), we see that (iii) 
is likewise independent. Hence 4, if it exists at all, does not depend on the 
Weierstrass equation. 
(c) Since A, satisfies conditions (i), (ii), and (iii) for E(L) and w, and 
since w restricted to K equals v, we see that »,, satisfies (i), (ii), and (iii) 
for E(K) and v. By the uniqueness already proven, the restriction of A» 
to E(K) equals A,. 

We turn now to the proof of existence. Property (ii) says that A should 
look like du oa !, at least close to O. Of course, this is no good for points 
with 2(P) close to 0, since away from O, \ is supposed to be bounded. So 
as a first guess for \ we might try the function 


Ai(P) = 3 max{v(2(P)~"), 0}. 


It turns out that 1 almost satisfies property (iii). We first need to make 
this precise, and then we will modify A, to produce X. 
Let 


o(x) = x* — bax? — 2bex — be, 

W(x) = 42° + box? + 2bgx + bg = (2y + a,x + a3)? 
be the usual functions on E, so the duplication formula [AEC III.2.3(d)] 
reads OP) 

z(2P) = ——~. 
OP) = oP) 
Define a function 
1 1 
f(P) = Aa ([2])P) — 41(P) — hacace)) + qv) 
for all P € E(K) with [2)|P 4 O. 

Notice that if A; were to satisfy (iii), then f would be identically 0. We 


are going to show that f is bounded. 
Using the definition of A, and the duplication formula, we can rewrite f 


max{|¢(P)|.,,, eo} | ae 
max{|2(P)|*,1} ; 


v? 


as 


f(P) = 5 log ( (A). 


(Recall that v(t) = —log |t|,.) A priori, f is defined at all points P € E(K) 
with [2])P 4 O. The following crucial result shows that more is true. 
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Lemma 1.2. With notation as above, f extends to a bounded continuous 
function on all of E(K). 


PRoor. Let 
max{|¢(P)|,,/(P)|,,} 


re = a 
max{|x(P)|,,1} 


7 


so f = }log(F)+4v(A). Clearly, F is continuous on E(K)< {O}. Further, 
as P — O, we find 


lim. F(P) = max{|O(P)],1o(P)],} = 
PO |az(P)|y—00 |z(P)|, 


Hence F’ extends to a continuous, bounded function on E(K). 
Since the limit equals 1, we also see that there is a constant c, > 0, 
depending on the chosen Weierstrass equation, such that 


|x(P)|, >o = F(P)> 4 


So in order to prove that log(F’) is continuous and bounded on E(k), we 
are reduced to showing that F' is bounded away from 0 on the set |z|,, < cy. 
From the definition of F’, we must show that there is a constant cg > 0 so 
that 


max{|@(P)|,,,|v(P)|,} 2c2 for all P € E(K) with |x(P)|, < cr. 


The polynomials ¢(x) and w(x) are relatively prime in A [a]. We give 
two quick proofs of this fact. 
Proof 1 (theoretical): The map [2]: & — E has degree 4 [AEC III.6.2(d)]. 
From the commutative diagram 


[2] 
— 


EB EB 
Poy 
pl ole Pt 


the rational map ¢$/7 has degree 4, so ¢(x) and w(x) have no common 
roots. 


Proof 2 (computational): An explicit computation shows that 
Resultant (¢(x), ¥(x)) = A?, 


where A is the discriminant of the given Weierstrass equation. Since A # 0, 
we see that $(x) and w(x) have no common roots. 
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Therefore we can find polynomials 6, ¥ € K[z] satisfying 
$(2)®(2) + (x) ¥(2) = 1. 


Evaluating this identity at 2 = x(P) and using the triangle inequality yields 
the desired result: 
1 < |o(x) O(2)|, + |Y(z)¥(z)|, 
< 2max{|(0)|,,}0@)],} -max{](0)|,.|%@)],} 
< ey' max{|¢(z)|,,|v(x)|,, } for |x|, <¢. b 
According to Lemma 1.2, the “naive” local height function 1 satisfies 

condition (iii) up to a bounded function. The next proposition shows how 
to decompose such a bounded function into a difference of two functions. 


Then these new functions will be used to modify A; so as to make (iii) hold 
exactly. 


Proposition 1.3. (Tate) Let 
f:E(K)—R 


be any bounded continuous function. Then there exists a unique bounded 
continuous function 
uw: E(K)—R 


such that 


f(P) = 4u(P) - 2([2]P) for all P € E(K). 


Proor. If the function s exists, then we can use its defining relation N 
times to compute 


MP) = 5 F(P) + 5H(21P) 


=1)P)+ Leap) + daca) 


16 


1 


= perl (OP) + aee(2™1P). 


n=0 


Since yw is supposed to be bounded, if we let N — oo, then the last term 
should disappear. So we define yw by the formula 


Coo 


wP) = 7 af (RP) 


n=0 
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and verify that it has the required properties. (Notice that if u exists, it 
must be given by this formula, so we get uniqueness for free.) 

First, since f is a bounded function, it is clear that the series is ab- 
solutely convergent, so yz is well-defined. But more is true. Each of the 
functions 

0 [2"|: E(K) —R 


is bounded and continuous. (It is easy to check that the multiplication 
maps [m] : E(K) — E(K) are continuous for the v-adic topology.) It 
follows that the series defining u gives a bounded continuous function 
on f(K). Finally, using Tate’s telescoping series trick, 


foe) 


4u(P) ~ w(l2P) = > (RMP ee all ([2"]P) = f(P). 


n=0 n=0 


oO 
We now have all the tools needed to prove the existence of and so 
complete the proof of Theorem 1.1. As above, let 


1 
Ai(P) = 3 max{v(x(P)~'),0}, 
1 
From Lemma 1.2, f extends to a bounded continuous function (also de- 


noted f) on all of F(A). Then Proposition 1.3 gives a bounded continuous 
function 4: E(K) — R satisfying 


f(P) = 4u(P) — p((2|P) for all P € E(K). 


Define 
MP) = a(P) + n(P). 


We now verify that \ satisfies properties (i), (ii), and (iii) of Theorem 1.1. 
(i) By inspection, A, is continuous on E(A) \ {O} and is bounded on the 
complement of any v-adic neighborhood of O. Since yw is continuous and 
bounded on all of L(A), X satisfies (i). 

(ii) If P is v-adically close to O, then A1(P) equals —4v(2(P)). We com- 
pute 


tim, {x(P) x s(ar))} = Jim, {alP) + w(P) + se(«tP))} 


jim, n(P) = n(0). 


Hence A satisfies (ii). 
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(iii) Using the formulas defining and relating A, A1, uw, and f, we find 


A([2]P) = Ai ((2|P) + ([2]P) 
= Ai ([2]P) — f(P) + 4u(P) 
= 4)1(P) + v((2y + aye + a3)(P)) — ¢v(A) + 4u(P) 
= 4X(P) + v((2y + ax + a3)(P)) — Gv(A). 


This proves that 2 verifies (iii) and completes the proof of Theorem 1.1. 
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The canonical height [AEC VIII 89] 
h: E(K) — (0,00) 


is a quadratic form defined in terms of the arithmetic of E(K). We now 
show that 2 can be decomposed as a sum of local height functions. 


Theorem 2.1. Let K be a number field, Mx the standard set of ab- 
solute values on K, and ny = [Ky : Q,] the local degree of v € Mx. 
(See [AEC VIII §5] for a description of Myx.) Let E/K be an elliptic 
curve, and for each v € Mx, let \, : E(K,) \ {O} — R be the local Néron 
height function associated to v as described in (1.1). Then 


; 1 
A(P) = Ko) 2a NyAy(P) for all P € E(K)  {O}. 


In order to prove Theorem 2.1, we will use the defining properties of 
(especially Theorem 1.1(iii)), together with the following fact, which we 
will prove later. 


Lemma 2.2. There is a finite set of absolute values S C Mx so that for 
allu ¢ S, 


AAP LS 5 max{v(0(P)"),0} for all P € E(Ky) x {O}. 


PROOF. This lemma says that for almost all absolute values, the naive 
local height 5 max{ v(x~),0} actually satisfies the quadratic property (iii) 
of Theorem 1.1. We will postpone the proof of (2.2) until §4, where we 
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will prove the more precise result (4.1) that A, = 3 max{v(x~!),0} for all 
finite places v such that the given Weierstrass equation has good reduction. 
oO 


PROOF (of Theorem 2.1). Let S be the set described in (2.2). Define a 
function 


L: E(K)\{O} —R, UP) = Ere Se ee): 


vEMK 
For any given P € E(K) x {O}, (2.2) implies that 
A(P)=0  ifvug Sand v(2(P)) > 0. 


Hence the sum }> nyA,y(P) has only finitely many non-zero terms, so L(P) 
is well-defined. 

Next we compare L(P) with h(a(P)). From Theorem 1.1(i),(ii), for 
each v € My there is a constant c, so that 


—Cy < Ay(P) - 5 max{v(x(P)-),0} < cy for all P € E(K,) ~ {O}, 


and (2.2) allows us to take c, = 0 for all v ¢ S. Now multiply by n,, sum 
over v € Mx, and divide by [K : Q]. This gives 


-c<L(P)- sh(2(P)) <e for all P € E(K) ~ {O}, 


where 


1 
C= [K:Q] > NyCy 


ves 


is finite and independent of P. In other words, if we set L(O) = 0, then 
1 
KP\= 5(2(P)) +O(1) for all Pe E(K). 


Finally, we verify the quadratic nature of L. Let P € E(k) be a point 
with [2]P #4 O. Then 
1 


L([2|P) = IK: Q Yo rvrv ([2]P) 
2 veEMK 


aa a no{ Ado (P) + u((2y + a1z + a3)(P)) + o(A)} 
= by Theorem 1.1(iii) 


papas es S~ n,A.(P) product formula [AEC VIIL5.3] 


[A : Q) veMK 
= 4L(P). 
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Since we have defined L(O) = 0, the relation L([2)/P) = 4LZ(P) holds 
for P = O, too. We must also verify it for P € E|2], PAO. The quickest 
way to do this is to use the triplication formula for \ (exercise 6.4e), 


dv ([3]Q) = Mv(Q) + v((3x* + box? + 3b42? + 3bgx + bg)(Q)) — “¥(A) 
for all Q € E(K,) with [3]Q 4 O. 


Summing over v € Mx as above gives 
L([3]Q) = 9L(Q) for all Q € E(K) with [3]Q 4 O. 
In particular, if P # O and [2]P =O, then 
L(P) = L([3]P) =9L(P), so L(P) =0. 


Hence 
L((2|P) = [L(O) =0= L(P) =4L(P). 


We have now proven the two relations 
L(P) = 5h(a(P)) +O(1) and L((2|P)=4L(P) for all P € E(K). 


The canonical height A also satisfies these relations [AEC VIII.9.3]. It 
follows that the difference F = L — h is bounded and satisfies F((2|P) = 
4F(P). Hence 


F(P) = pF (2 IP) = so F(P)=0 bral Pe B(K). 


N— oo 


Therefore L = h. oO 


§3. Archimedean Absolute Values — Explicit Formulas 


Let K be a field which is complete with respect to an archimedean absolute 
value | - |,, and let E/K be an elliptic curve. Then K is isomorphic to 
either R or C, and | - |, corresponds to some power of the usual absolute 
value. Thus in order to compute the local height function 4 over E(K), it 
suffices to consider the case that K = C, so in this section we will derive 
explicit formulas for the local height for elliptic curves over the complex 
numbers. 
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Recall that an elliptic curve E/C has an analytic parametrization 
C/A— E(C), z+ > (@(z; A), (2; A)), 
with Weierstrass equation 
E: (9')? = 49° — ga(A)p — ga(A) 
having discriminant 
A(A) = go(A)? — 27ga(A)?. 


(See [AEC VI.3.6] and (I.4.4).) We put this in standard Weierstrass form 
by the substitution 


1 
z= (2), y= 502), 
yielding the equation 


1 
E:y=2- qo2(A)z — 793(A) with discriminant A = A(A). 


Recall also the Weierstrass o-function (1.5.4) 


o(z) =o0(z;A) = II (a - =) a 3 (2), 


wed 


which has a simple zero at each lattice point and satisfies the transformation 
formula 


o(zt+w) =b(we™242)o(z) forall z EC, wEA. 


Here y» : A — {+1} is the map with ~(w) = 1 if and only if w € 2A, 
and 7: A > C is the quasi-period homomorphism. 
We have seen (1.5.6b) that there is a factorization 


g(a) = -T. 


Applying log| - | yields 
log|o(2z)| = 4 log|o(z)| + log|9’(z)|. 


Since g'(z) = 2y, comparison of this equation with Theorem 1.1(iii) (and 
the fact that the local height has a pole at O) suggests that the local height 
function on E(C) = C/A should look like —log|o(z)|. Unfortunately, the 
transformation formula for o(z) shows that |o(z)| is not invariant under 
translation by A, so —log|o(z)| is not well-defined on C/A. The next 
proposition explains how to modify o(z) to obtain an invariant function. 
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Proposition 3.1. Let A c C be a lattice. Extend the quasi-period 
map 7 : A — C linearly to obtain an R-linear homomorphism (also de- 
noted 7) 

7: C+A8,R—C. 


(a) For all z,w € C, the quantity 
zn(w) — wn(z) 
is purely imaginary. 
(b) Define a function 
F(z) = e722") q(z), 
Then 
F(z +w) = p(w)e? 21) #9) F(z) for allz EC, w EA. 


(Note that F(z) is not holomorphic, because n(z) is only R-linear.) 
(c) The function |F (z)| is a well-defined function on C/A and is real- 
analytic and non-vanishing away from 0. 


PROOF. (a) Choose a basis w,,w2 for A with Im(w,/w2) > 0. Legendre’s 
relation (1.5.2d) says that 


W1N(W2) — wen(w,) = 27. 
Write 
zZ=awu,+bue, w=cw, + dug, with a,b,c,d€ R. 
Then the R-linearity of 7 and Legendre’s relation give 


zn(w) — wn(z) = (ad — be) (win(we) — wen(w1)) 
= (ad — bce)2ni. 


(b) Using the transformation formula (1.5.4) for o(z) stated above, we 
compute 


F(z +w) =e 22+#)124+) o(z + w) 


= en 22TH)NZ+H) ah (y eM) 242) g(z) 
= Y(w)e2 en) -onl2)) F(z), 
(c) Since ~(w) = +1, and since (a) implies that 


ed (2n(w)—wn(2))| 1 
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it follows from (b) that |F(z)| is well-defined on C/A. Further, F(z) is 
clearly real-analytic on C and vanishes only at points of A, so |F(z)| is 


real-analytic and non-vanishing on C\A. 


Theorem 3.2. Let E/C be an elliptic curve with period lattice A. Then 
the Néron local height function 


A: E(C)\ {0} —R 
is given by the formula 


A(z) = — log Je“ Fe o(z)A(A)# 


a 5 Re(2n(2)) —log|o(z)| — a log|A(A)|, 


where 7: C — C is the extension of the quasi-period map described above 
in (3.1). 


PROOF. Let A(z) be the indicated function. We must verify that \ satisfies 

properties (i), (ii), and (iii) of Theorem 1.1. First, (3.1c) ensures that A 

is well-defined on E(C) x {O}. Further, since o(z) is holomorphic on C 

and non-vanishing on C ~ A, it is clear that A(z) is actually a real-analytic 

function on E(C) \ {O}. Hence it satisfies property (i) of Theorem 1.1. 
Next we observe that the limit 


lim Nz) + 50(0(2)) 


= lim, {5 Re(en(2)) ~ 5 loslo2)*o2)| ~ 75 lox|A(4)| | 


exists, since o(z) has a simple zero and g(z) has a double pole at z = 0. 
(In fact, the limit equals 7 log|A(A)|.) This verifies property (ii). 
Finally, we must check property (iii). From (1.5.6b) we have 
log|o(2z)| = 4log|o(z)| + log|~’(z)|, 
and the linearity of n(z) gives 
Re(2zn(2z)) = 4Re(zn(z)). 


Subtracting the first equation from half the second, and then subtract- 
ing ty log|A(A)| from both sides, we obtain the desired relation 


\(2z) = 4A(z) — log|"(z)| + qloslA(A)]. 
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(Note that g/(z) = 2y and A(A) = A for the Weierstrass equation y? = 
a — 4920 — 493.) 
This proves that \ has properties (i), (ii),and (iii) of Theorem 1.1, so » 
is the Néron local height function on E(C). oO 
Remark 3.2.1. In proving Theorem 3.2, we verified directly that the func- 
tion ‘ p 
— log |e~ 27") o(z)A(A)® 


has properties (i), (ii), and (iii). This gives an alternative proof of existence 
for Theorem 1.1 in the case of archimedean absolute values. 


Corollary 3.3. The local height function 
A: E(C)\ {O} —R 
satisfies the quasi-parallelogram law 
1 
AP +Q)+A(P — Q) = 2A(P) + 2A(Q) + v(z(P) — z(Q)) _ gt) 
for all P,Q € E(C) with P,Q, P+Q ZO. 


(Note that the quantity (a(P) — x(Q))®/A is well-defined, independent of 
the choice of a particular Weierstrass model for E.) 


Proor. The Weierstrass ¢-function has the factorization (I.5.6a) 


é; _ _o(z+w)o(z—w) er 
(2) — p(w) = aeerreny eT ry ad for all z,w € C. 


Hence 
—log|o(z+w)|—log|o(z—w)| = —2 loglo(z)|—2 log|o(w) |—log|p(z)—e(w)]. 
Next, the linearity of 7(z) immediately implies 

(z+ w)n(z + w) + (2 — w)n(z — w) = 22n(z) + 2wy(w). 


Apply 4 Re( -) to this last equation and add it to the previous one. Com- 
parison with the formula (3.2) for A(z) yields 


1 
A(z +w) + A(z — w) = 2A(z) + 2A(w) — log|e(z) — e(w)| + rs log|A(A)|, 
which is exactly the desired identity. oO 


It is often convenient to use the Fourier expansions for o(z) and A(r) 
to rewrite the formula for the local height A(z). 
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Theorem 3.4. Let E/C be an elliptic curve with lattice Zt + Z normal- 
ized so that Im(r) > 0. As usual, let 


20iz SOE 


u=e and q=e 


and identify 
E(C) & C/(Zr+Z) = C*/q@ 


z arc U. 


(See (I §6) and (V §1).) Then the local height function 
A: E(C)\ {O} —R 
is given by the formula 
Mz) = -5Be (= ) log |q| — log |1 — ul — S> log|(1 — g”u)(1 — gu), 
n>1 


where i 
Bo(T) =T? ~T+= 
is the second Bernoulli polynomial. 
Remark 3.4.1. The formula (3.4) for the local height A(z) is sometimes 
rewritten using the equivalent quantities 
Imz _ log|u| _ v(u) 
Imr  log|g|_—-u(q) 


Proor. The Weierstrass o-function has the product expansion (1.6.4) 


a(z) = edits? may _ y TI (1 _ q mu) ( (1 ~~ q mart): 


Qi ors (f=)? 


The modular discriminant function has the product expansion (1.8.1) 


A(r) = (2n)!q TJ - 9"). 


n>1 
Hence 
e7 2212) g(z)A(r) % 


= 2 2(1(1)2—m(2)— 2m) a5 (1 —u) [1a Sigal qru-') 
n>1 
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To simplify the exponential, we use Legendre’s relation (1.5.2d), which 
in the case of a normalized lattice Zr + Z says 


TH(1) — n(7) = 2ri. 


Writing 
z=aT+b with a,bDER, 


we find 


n(l)z — n(z) — 2ni = a(rn(1) — n(r)) — 202 = 2mi(a — 1). 


Hence 
e22(m(L)2—n(2) 274) ois ~ e2 (at +b)-2ri(a—1) : eis 27iT 
= ef at+¢@)nit . ef ane 
so 
e22((1)2—m(2) 2m) @ zy en qz(a ats) ; 


Substituting this in above yields the formula 


2(a?-a+2 n Nay 
q?* FH (1 —u) [[-aru)(t- qu"). 
n>1 


le“ o(z)A(r) 


Theorem 3.2 says that applying —log(-) to the left-hand side gives the 
local height A(z). Since Im(z) = Im(ar + b) = aIm(r), this completes the 
proof of Theorem 3.4. Oo 
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Let K be a field with absolute value v, and let E'/K be an elliptic curve. If 
the absolute value v on K is non-archimedean, then we can talk about the 
reduction of F modulo v. More precisely, fix a Weierstrass equation for E 
with v-integral coefficients (i.e. v(a;) > 0). We consider the reduction E of 
that Weierstrass equation modulo the maximal ideal of the local ring R = 
{a € K : v(a) > 0}. The reduced curve E may be singular. Define a 
subset Eo(K’) of E(K) by 


Eo(K) = {P € E(K) : P is a smooth point of E}. 


In particular, if the reduced equation is smooth (E£ has good reduction), 
then Eo(K) = E(K). (See [AEC VII], which discusses in some detail the 
case that v is a discrete valuation.) 

We now show that for points in Fo(K) the local height is given by a 
simple formula. 
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Theorem 4.1. Let K be a field complete with respect to a non-ar- 
chimedean absolute value v, let E/K be an elliptic curve, and choose a 
Weierstrass equation for E with v-integral coefficients, 


E:y?+acy +a3y = x? + aor? + age +6. 


Let A be the discriminant of this equation. Then the Néron local height 
function \ : E(K)~\ {O} — R is given by the formula 


A(P) = 5 max{v(2(P)~"), 0} + 4) for all P € Eo(K). 


Remark 4.1.1. If & has good reduction, then we can find a Weierstrass 
equation for E/K with Eo(K) = E(k) and v(A) = 0. In this situation, 
the proof of (4.1) will show that the function 


5 max{v(x(P)-'),0} 


has properties (i), (ii), and (iii) of Theorem 1.1. This provides an alterna- 
tive proof of the existence of 4 in this case. Further, since A is invariant 
under finite extension of the field K (1.1c) and is independent of the choice 
of Weierstrass equation (1.1b), we actually obtain an existence proof when- 
ever £ has potential good reduction. 


Remark 4.1.2. The local height » is independent of the choice of Weier- 
strass equation. But the formula for \ given in Theorem 4.1 does not 
appear to be independent of this choice. For example, the change of co- 
ordinates x = u~22’, y = u~3y/ will alter the formula in Theorem 4.1 
to 


1 1 
5 max{v(2’(P)~'),0} + 4’) 
1 ~2 -1 L712 
or max{u(u-7a(P)~"),0} + uw w) 
However, this new formula for \(P) is valid only for points in Eg(K), 
where Ej(K) is defined using the equation with coordinates (z’,y’). One 


can verify that the two formulas agree on the intersection Eo(K)M E6(4), 
as they should. 


ProoF (of Theorem 4.1). Let 
1 
Ay(P) = 5 max{o(2(P)-1),0} + p04). 


By inspection, A; satisfies conditions (i) and (ii) of Theorem 1.1. We next 
verify condition (iii) for points in Eo(K) ~ {O}. 
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As usual, let 


d(x) aa a a bax? — 2bex = bg, 
Wz) = Ax + box? + 2bga + bg = (2y + aya + a3)? 


be the functions appearing in the duplication formula 


0) 


Then the equation 
1 
Ay ((2]P) = 41 (P) + v((2y + a,xr+ a3)(P)) a qv) 
to be verified is equivalent (after some algebra) to 


min{v(o(P)),v(~(P)) } = min{4v(a(P)), 0}. 


First suppose that v(a(P)) < 0. Then the (non-archimedean) triangle 
inequality yields 


v(d(P)) = 4v(a(P)) and = v((P)) > v(42(P)*) > 4v(x(P)), 
so the desired relation is true. We are left to prove 
PeéE(K) and v(2(P)) > 0 => min{v(d(P)),v(v(P))} =0. 


To prove this, we must express the condition P € Eo(K) in terms 
of @(P) and w(P). Let 


F(X,Y) =Y? + a, XY + a3Y — X94 — agX? -— aX — ag 


be the polynomial defining EZ. Recall that a point (xo, yo) € E is singular 
if and only if 
Fx (0, yo) = Fy(zo, yo) = 0. 


(See [AEC 1.1.5]. The subscripts denote partial derivatives.) Now Eo(K) 
consists of all points whose reduction modulo v is non-singular on the 
reduced curve £. So we see that 
P € Eo(K) => v(Fx(P)) <0 or v(Fy(P)) <0. 
We also recall the addition formula [AEC III.2.3(c)], which says that 


#([2|P) =m? + am — a2 — 22(P), 
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where m is the slope of the tangent line to FE at P. (Note that m # oo 
since [2]}P £0.) Thus 
_ Fx(P) 
Fy(P)’ 


and so we find eee ere 


for the polynomial 
G = a,Fy — (ag + 2X)Fy € R[X,Y| C K(E). 


(N.B. G has coefficients in the valuation ring R, since by assumption the 
coefficients of F' are v-integral.) Thus 


d=FR+GFy and yw=F2. 
Now let P € Eo(K) satisfy v(a(P)) > 0. Then 
0 < min{v(d(P)),v(v(P))} since v(z(P)) > 0 


= min{v(Fx(P)? + G(P)Fy(P)),v(Fy(P)?)} 


=0 since either v(Fx(P)) <0 or v(Fy(P)) < 0. 
This completes the proof that 


Ai((21P) = 41(P) + 0((2y + are + a5)(P)) ~ Go(A) 
for all P € Eg(K) \ {O}. 


We have now shown that A; satisfies conditions (i), (ii), and (iii) of 
Theorem 1.1 for all points in Eo(K) \ {O}. If Eo(K) = E(K), that is, if E 
has good reduction and we take a minimal Weierstrass equation for FE, then 
the uniqueness assertion of Theorem 1.1 implies that 4 = A. However, 
even if the Weierstrass equation for F has singular reduction, the proof of 
uniqueness in Theorem 1.1 works for the subgroup Eo(K). A brief sketch 
follows. 

From (i) and (ii), the difference A = » — 41 extends to a bounded, 
continuous function on all of Eo(A’) (in fact, on all of E(K)). Further, 
from (iii) it satisfies A([2]}P) = 4A(P) for all P € Eo(K). (By continuity, 
this holds even when [2|P = 0.) Then 

A(P) = pA(2%)P) rome so A(P) = 0 for all Pe Eo) 

It remains to find an explicit formula for the local height in the case 
that £ has bad reduction at v and P is not in Eo(K). Since the local 
height is invariant under finite extension of K (1.1c), it suffices in principle 
to consider the case that E has split multiplicative reduction at v. For 
computational purposes, however, it is often more convenient to work di- 
rectly over K. Explicit formulas for A in the case of additive reduction are 
given in exercises 6.7 and 6.8. 
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Theorem 4.2. Let K be a p-adic field (i.e., a finite extension of Q,) with 
absolute value v = —log| - |y, let ¢ € K* satisfy |q|, < 1, and let E,/K be 
the Tate curve (V.3.4) with its parametrization 


Q: Kg => E,(K). 


(a) The Néron local height function 
Aog: Ey(K)\{O} —R 
is given by the formula 


A(o(u)) = 5B: (3) v(q) +u(1 —u) + ab v((1 — g”u)(1 — q"u7*)). 


(b) If we choose u (by periodicity) to satisfy 
0 < v(u) < vq), 


then 


ae Ga v(q),  if0 < v(u) < v(q), 


A(o(u)) = : 
v(l-u)+—v(q), if v(u) =0. 
(Note that for a Tate curve, v(q) = —v(j(Eq)) = v(A(@).) 


Proor. (a) The Tate parametrization (V.3.4) is a v-adic analytic map 
from K*/q” to the elliptic curve with Weierstrass equation 


Eg: yr t+ary= x2 t+ayg(q)et+ a¢6(q) 
defined by 
b:K*/¢ > E(K), —o(u) = (X(u), Y(u)). 


(For the series defining a4(q), a6(q), X(u), and Y(u), see (V.3.1).) The 
discriminant of the Weierstrass equation for E, has the product expan- 
sion (V.3.1b) 


A(q) =a] [a-4")™. 


n>1 


Recall also the v-adic 6-function (V.3.2) 


(u) = (1 —w) T] P= Pw aru) 


— qn)2 
ee (Eg) 
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and the factorization (V.3.2b) 


uO(u?) 


2Y (u) + X(u) = — Aut 


Applying v to this relation and doing a little algebra, we find that 
1 1 
{v(o(u2) = sou) =4 {»(0(u)) ~ sx} + v(2Y(u) + X(u)). 


This suggests that v(@(u)) — 4v(u) — 7v(A) would be a good candidate 
for A, since it has property (iti), but oo. it is not invariant under 
the transformation u +> qu. So we make a slight alteration to obtain an 
invariant function. 

Define is (u) 

v(u 
Mu) = = 0 
(u) = 5 Ba (2) wa) + 0(0(u)) 

We will show that is the Néron local height function by verifying that it 
satisfies properties (i), (ii), and (iii) of Theorem 1.1. 

First, using the identities 


ge) ae B(T +1) = BT) +27, (qu) = ~ 00) 


v(q) vq) 


(the last is (V.3.2a)), it is easy to check that A(qu) = A(u); so A is well- 


defined on 
Eq(K) \ {O} = (K*/q*) < {1}. 


Next, the product defining @ is absolutely convergent and non-zero 
away from q“. Hence is continuous (in fact, v-adically analytic) on 
E,(K) ~ {O} and bounded on the complement of any neighborhood of O. 
This verifies (i). 

To check (ii) we compute 


lim {aw + 5(x(w)} = 5 Bo(0)v(a) + 5 lim v(0(u)?X (u)) 
= 5 Ba(0)v(a) + 5 lim, v((l =u)? X(u)). 


The series for X(u) given in (V.3.1) is 


-1 


qhu 
at ees aa + ap} 2a, 
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so we see that the pole at u = 1 comes only from the n = 0 term. Hence 
lim (1 — u)?X(u) = 1, 

which proves that the above limit exists. (In fact, the limit is j,v(q).) 


Finally, we verify the duplication formula (iii). Note first that for 
all n > 1 we have v(1 — gq”) = 0, so 


v(A(q)) = v(4 II‘ (1-q” Y\= = v(q). 
Next we add the formula 


(»(o0u*) = sv?) a (o(eu) 7 5v(u)) 4 v(2¥ (u) + X(u)) 


obtained above to the identity 


(3 Es a | v(q) =4 (5 (oY. ts 5) v(q) — 70(a)- 


Since v(q) = v(A(q)), this gives property (iii): 


A(u2) = 4A(u) + v(2¥ (a) + X(u)) — 7v(A(a)). 


We have now shown that A(w) satisfies properties (i), (ii), and (iii) of 
Theorem 1.1, so it is the Néron local height function. 
(b) Since 0 < u(u) < v(q), we have 


v(1 — q™u) = v(1—g"u7') =0 for all n > 1. 
So the formula in (a) becomes 


v(u) 
() 
If in addition v(u) > 0, then v(1 — u) = 0, which gives the first expres- 


sion. Similarly, if v(u) = 0, then the second expression is a consequence 
of B2(0) = 1/6. o 


A(u) = 55, (44 ) va) + o(t = w). 


Remark 4.2.1. The proof of (4.2) shows directly that the function de- 
scribed in (4.2a) satisfies conditions (i), (ii), and (iii) of Theorem 1.1, so 
we obtain an independent proof of the existence of the Néron local height 
for Tate curves. We have now given proofs of the existence of the Néron 
local height, independent of the proof in §1, in the following three cases: 
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(A) K =C, see (3.2.1); 
(B) K/Q, and E/K has good reduction, see (4.1.1); 
(C) K/Q, and E = E, is a Tate curve, see (4.2). 


But if K is the completion of a number field with respect to some absolute 
value, and E/K is an elliptic curve, then we can find a finite extension L/K 
so that E/L falls into one of the three cases (A), (B), or (C). This follows 
from [AEC, VII.5.5] and (V.5.3). Now using the elementary fact (1.1c) that 
the local height is invariant under field extension, we obtain an independent 
proof of the existence of \ for all E/K. 


EXERCISES 


6.1. Let K be a field, | - |, an absolute value on K, and E/K an elliptic curve. 
For any rational function f € K(£), let 


U;s(K) = {P € E(K) : f is defined at P}. 


(a) Prove that the map 
Udy -3R, Pee lf), 


is continuous. (Here U;() inherits the v-adic topology from E(K), and R 
is given the usual topology.) 

(b) Prove that the topology on E(K) described in §1 is the weakest topol- 
ogy (i.e., the topology containing the fewest open sets) such that the maps 
in (a) are continuous for every rational function f € K(£). 


6.2. Let K be a field, | - |, an absolute value on AK, and E/K an elliptic curve. 
If K is locally compact, prove that E(A) is compact. In particular, E(k) 
is compact if K =R, K =C, or K is a finite extension of Qp. 


6.3. Let K be the completion of a number field with respect to some abso- 
lute value, and let E/K be an elliptic curve. Prove that for all P,Q € 
E(K) with P,Q,P +Q # O, the Néron local height » satisfies the quasi- 
parallelogram law 


MP + Q) + A(P — Q) = 2A(P) + 2X(Q) + v(2(P) — 2(Q)) — Zv(A). 


(Hint. We already proved this for K = C in Corollary 3.3. Going to an 
extension field, it suffices to prove the result when K is a finite extension 
of Q, and E has either good or split multiplicative reduction. For the 
former, the formulas for z3 + x4 and x34 in the proof of [AEC, VIII.6.2] 
may prove useful, at least if p # 2,3; whereas for the latter, you can use 
the Tate curve together with (V.3.2b) and (4.2) to mimic the proof of 
Corollary 3.3.) 
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6.4. 


6.5. 


Let K and E£ be as in the previous exercise, and fix a Weierstrass equation 
EB: y° tairyt+azsy = e+ aor” + asx t+ ag. 
For each integer m define a function 


Fyn(x) = m? Il (x — x(T)) € K(E). 


TEE|[m} 
T#O 


(a) Prove that 
an(hy) = 2( - (r)) ~ 2m(O). 
TEE[m] 
(b) Prove that 
Fy = Ag? + box? + 2b4x + b6 = (Qyt+aiat+ a3)°, 
F3 = (327 + box? + 3bax? + 3bex + bs)’, 
Fy = Fo(2a° + box? + 5b4x* + 10b6x°* 
+ 10bgx” + (bzbg — babe) x + babg — bg)”. 
Generally, show that there exist functions jm € K(x,y) = K(£) satisfying 
Fy, = U2,. (Wm is the m'?-division polynomial; see [AEC, exercise 3.7].) 
(c) Let A be the discriminant of the given equation. Prove that the func- 
tion : 
Fyn(x)® 
Am? 1 
is independent of the Weierstrass equation. 
(d) Prove the recurrence formula 


Fn4iFm—1 = (20 [m] —2)F?, for all m > 2. 


By convention, we set F\(x) = 1. (Hint. Compare divisors. Then to find 
the constant, let P — O.) 
(e) Prove that 


Alm] P) = mP XP) + 50(Fn(P)) - ™*0(d), 


for all P € E(K) with [m]P #4 O. 


Let E/C be an elliptic curve with normalized lattice Zt + Z, and let A(z) = 
A(x + ty) be the local height function on E(C) \ {O}. 


(a) Prove that 
| A(z) dx dy = 0. 
E(C) 


(Note that this is an improper integral, since A(z) blows up at z = 0. Be 
sure to check that the integral converges.) 
(b) Prove that A(z) is a solution of the differential equation 


oP om Qa 
(Fs 2 a) Me Taser? 
(This exercise, together with Theorem 1.1(ii), says that A(z) is the Green’s 
function on E(C) for the divisor (O).) 
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6.6. 


6.7. 
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Let K be a field with absolute value v and let E/K be an elliptic curve. 
Fix a Weierstrass equation for E, 


E:y=c2+Ar+B, 
with discriminant and j-invariant 


A =—16(44°7+27B?) and = j = —(48A)°/A. 


(a) If v is non-archimedean, and A and B are v-integral, prove that for 
all P€ E(K,) \ {O}, 


~ 1 max{v(j(E)~*), 0} < A(P) — ; max{v(2(P)~*),0} < (A). 


(Hint. Use the explicit formulas (4.1) and (4.2).) 
(b) Ifv is archimedean, prove that for all P € E(.K.) \ {O}, 


1/6 
Na - 5 max {" (5) oh) < 5 max{o(j(E)~"),0} + v(3). 


(c) Now suppose that & is a number field, and A and B are in the ring of 
integers of K. Prove that for all P € E(K), 


iP) = sh(o(P))| < sh(3) + aha) + log 3. 


(The constants in (b) and (c) are certainly not best possible. See if you 
can improve them.) 


The next three exercises give formulas for the local height which are es- 
pecially well suited for numerical computations. We begin with the non- 
archimedean case. 

Let K be a field complete with respect to a discrete valuation v, 
let E/K be an elliptic curve, and fix a minimal Weierstrass equation for E, 


y? +airy + asy = 2° + agx” + ase + ae. 


Let A be the discriminant of this equation, and let P € E(K). 
(a) Prove that P € Eo(K) if and only if either 


v((3x? + aga + a4 —ary)(P)) <0, or v((2y+aiz+a3)(P)) <0. 
Note that if P € Eo(K), then (4.1) says that 

MP) = 5 max{u(2(P)~1), 0} + aud). 
(b) Assume that v(A) > 0 and v(c4) = 0. (This means that E has multi- 
plicative reduction at v; see [AEC VII.5.1b].) Let 


__. fu((2y+aiz+as)(P)) 1 
a(P) = min | eu teaere®) st. 


Prove that if P ¢ Eo(K), then the local height of P is given by the formula 


ACP = 5 Ba(a(P))o(A). 
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6.8. 


6.9. 


*Let K, v, E and A be as in the previous exercise. Suppose that EF has 
additive reduction (i.e., v(A) > 0 and v(ca) > 0.) Let P € E(K) with 
P ¢ Eo(K). Let F2 and F3 be the polynomials defined in exercise 6.4b. 
Prove that 


ae —50(FA(P)) + 504) if v(Fs(P)) > 3v(F2(P)), 
~450(Fs(P)) + 4) otherwise. 


Let E/R be an elliptic curve given by the usual Weierstrass equation 
2 3 2 
y +ayry + azy = © + ox" + asx + ae, 
and suppose that «(P) # 0 for all P € E(R). (Note that one can always 


achieve this condition by making a shift x = x’ + r for sufficiently large r.) 
Then the functions 


1 
aes w = 4t + bot? + Qbat? + dbet*, 2 = 1 — bat? — 2het? — bet’, 


are well-defined for all points in E(R). 
(a) Prove that 


This gives a convenient recursive formula for computing z([2”]P). 
(b) Prove that there are constants c1,c2 > 0, depending on the Weierstrass 
equation, so that 


ci < 2(P) < ce for all P € E(R). 


Conclude that the series 
co 1 h 
> Fe loe|2((2"1P) 
n=0 


is absolutely convergent for all P € F(R). 
(c) Prove that the local height function on E(R) \ {O} is given by the 
formula 


1 1 lvl ig 
A(P) = 5 logle(P)| — zy log Al + 5) ga loele((2"1P)1. 
n=0 


(d) Give a modification of the series in (c) which converges to give the 
local height function on E(C) \ {O}. (Note that in this case there will 
always be points with 2(P) = 0.) 
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6.10. Using the previous three exercises, compute the canonical height A(P) of 


the indicated point on each of the following curves. 


(a) y +y=2° 2, P = (0,0), 
(b) yty=2—2’, P = (0,0), 
(c) y? +acyty = 2° — 2? — 48x + 147, P = (13,33), 


(d) y t+ayty=a2% +27 —1001lz+12375, P= (45,224). 


. Define the periodic second Bernoulli polynomial Ba(t) by 


1 

Ba(t) = (¢— [))"-(t- ll) +5 

(ie., Bz equals Bz on the interval 0 < t < 1, and is extended periodically 
modulo 1 to all of R). 


(a) Prove that B2(t) has the Fourier expansion 
1 Qrikt 


e€ 
B2(t) = 55 a a 


kEZ, k£0 
(b) Let t1,...,tn~ € R. Prove that 


N 
) Ba(ti —t;) > aor 
1St,joN 
tAG 


(c) Let K be a complete field with discrete valuation v, let E/K be an 
elliptic curve, and let A be the discriminant of a minimal Weierstrass equa- 


tion for F at v. Prove that for any collection of distinct points Pi,..., Px € 
E(k), 
N 
dS AP - Pi) > - rl). 
1<i,j<N 
ixj 


( Hint. Since 2 is invariant under finite extension of K, it suffices to consider 
the two cases of good and split multiplicative reduction.) 

(d) Let K be complete with respect to a discrete valuation v, and sup- 
pose that E& has split multiplicative reduction. Choose a parametriza- 
tion E(K) = K*/q", and let P € E[N + 1] correspond to q'/(%t+), (Take 
any root in K.) For each i, 1 <i< N, let P; = [i)P. Prove that 


S- ABP) = — Ful) (1 = <=) 


1<i,j<N 
tfj 
Thus the estimate in (c) is essentially best possible. 
(e) *Let £/C be an elliptic curve. Prove that there is a constant c = c(E) 
so that for any set of distinct points Pi,..., PN € E(C), 


SS) MPP) > —5Nlog N — (E)N. 


1<t,j5N 
t#5 


(This is the archimedean analogue of (c). It is quite difficult.) 


APPENDIX A 


Some Useful Tables 


§1. Bernoulli Numbers and ¢(2k) 


1 
Values of the Riemann ¢-Function at Even Integers, ¢(s) = a 
n>1 m 
1? 14 
2) = ae 4) = ae 
7 78 
6 — a = ee re 
¢(6) 33.5-7 ¢(8) 2.33.52-7 
qi 691712 
10) = a [DS 
¢(10) 35-5-7-11 ¢(12) 36 .53.72.11-13 
Qni4 
14S ea 
oy) 36 .52-7-11-13 
x St 
Bernoulli Numbers a = 2 Be 
1 1 1 
Bs = = = seabed See me 
6 Bs 30 Be 42 
1 5 691 
Bo = ne: ee 2 os piesa 
30 fe 6B 2730 
7 3617 43867 
Biz = = Big = —— = oAat 
- 6 Bio «= 798 
283 - 617 11-131 -593 103 - 2294797 
Bay = 5 — Boo ae a rire ae I EE 
oe 330 a8 138 Bos 2730 
13 - 657931 7 - 9349 - 362903 
Bog = ——— Boa = -——— 


6 870 


482 A. Some Useful Tables 


§2. Fourier Coefficients of A(7) and j(7) 


Fourier Coefficients of (2m)! A(r) = > T(n)q" =q Il se ae 


n>0 n>1 
T(1)= 1 7(2)= —24 7(3) = 252 
r(4)=  —1472 (5) = 4830 r(6)=  —6048 
T(7)=  -16744 7(8) = 84480 7(9) = —113643 


7(10) = —115920 r(11) = 534612 = r(12) = —370944 


The function 7(n) is called the Ramanujan 7 function. For values of T(n) 
with n < 300, see Lehmer [1]. 


1 
Fourier Coefficients of j(7) = — + ~ c(n)q”. 
q n>0 


c(0) = 744. (1) = 196884 
c(2) = 21493760 = c(3) = 864299970 
e(4) = 20245856256 —c(5) = 333202640600 
c(6) = 4252023300096 —¢(7) = 44656994071935 
e(8) = 401490886656000 


1 
Inversion of Series for j Function, q = > d(n)—. 
n>1 


d(1) = 1 d(2)= 744 
d(3) = 750420 d(4) = 872769632 
d(5) = 1102652742882 (6) = 1470561136292880 
d(7) = 2037518752496883080 (8) = 2904264865530359889600 
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§3. Elliptic Curves over Q with Complex Multiplication 


In this section we describe all elliptic curves defined over Q with complex 
multiplication by an order R = Z+ fRx of conductor f in a quadratic 
imaginary field K = Q (/-D) of discriminant —D. The first table gives 
the j-invariant for each such order. The second table gives a representative 
elliptic curve E over Q with the specified 7, together with the minimal 
discriminant Ag and conductor Ng of E. Those curves possessing endo- 
morphisms of degree 2 are discussed in (II.2.3.1). 


Discriminant Conductor | j-invariant 
—-Dofk sill: fofR of E 
—3 1 0 
2 243353 
[| 3 2153 . 53 
—4 1 2032 
2 223d? 
—7 1 —3353 
2 3353173 
—8 ir 1 2858 
11 1 —215 
—19 1 =91938 
—43 1 — 2183353 
—67 1 — 2153353118 
-163 fk 1 — 2183353933298 
Minimal Weierstrass 
[_ ee aiff a equation of FE over Q Az eee 
—3 1 yty=x? 3° 33 
2 y? = 2° — 15x + 22 2835 2738 
3 yety=z° — 3024+ 63 3° a 
—4 1 y=arte 2° oe 
2 y? =a? —-llea+14 2° 22 
—7 1 | y+ey=ae2—-2?-2r-1 2 7? 
2 | y? = x3 — 5952 + 5586 2173 | 947? 
—8 1 y? = 2° 4+ 4a? + Qe 2° 28 
-11 / 1 yity=22—-2?—-7r+10 is 117 
=i Et MeL Me ye ty =z — 3824 90 193 19? 
—43 1 y? +y = x — 860x + 9707 433 43? 
—67 1 y? +y = 2° — 7370x + 243528 67° 67? 
—163 |] 1 y? +y = x? — 21744202 + 1234136692 163° 163 


Notes on Exercises 


Many of the exercises in this book are standard results which were not included 
in the text due to lack of space, whereas others are special cases of results which 
appear in the literature. The following list thus serves two purposes. First, it 
is an attempt by the author to give credit for the theorems which appear in the 
exercises, and second, it will aid the reader who wishes to delve more deeply into 
some aspect of the theory. However, since any attempt to assign credit is bound 
to be incomplete in some respects, the author herewith tenders his apologies to 
anyone who feels that they have been slighted. 

Except for an occasional computational problem, we have not included so- 
lutions (nor even hints). Indeed, since it is hoped that this book will lead the 
student on into the realm of active mathematics, the benefits of working without 
aid clearly outweigh any advantage that might be gained by having solutions 
readily available. 


CHAPTER I 


(1.1) For an elementary proof, see Alperin [1]. 
(1.5) (e) Let h(—D) denote the class number of Q (Vv —D ). Then A(—3) = 1, 
h(—5) = 2, h(—23) = 3, h(—29) = 6, h(—47) =5. 
(1.10) See Serre [3, VII, §3.2, Cor. 2]. 
(1.11) A similar argument is given in Serre (3, VII, Thm. 3] and Apostol {1, 
Thm. 2.4]. 
(1.13) See de Shalit [1, II §2, equation (4)]. See also Weil {1, Ch. II, IV]. 
(1.14) Proven in Stark [2] using Kronecker’s limit formula. Alternatively, one 
can compare poles and zeros to see that the ratio is constant, and then 
let z — 0 to find the constant. 
(1.16) Answer: s(2,y) = (y—1)(y—5)/24 if y is odd, and s(2, y) = (y?+5)/24y 
if y is even. 
(1.19) (a) See Shimura [1, exercise 3.27]. (b,c) See Shimura [1, Thm. 3.24]. 
(1.20) (a,b) See Shimura [1, exercise 2.8}. 
(1.22) See Lang [2, Ch. III §4], Shimura [1, Ch. 3, §§4,5], or Ogg [1]. 
(1.24) See Serre [3, Ch. VII §4.3]. 
(1.25) See Apostol {[1, Thm. 6.16]. 
(1.26) This is due to Hecke. See Apostol [1, Thm. 7.20]. 
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(1.29) (c) This is due to Hecke. See Ogg [1, Ch. I, Thm. 1, p. IL-5]. 
CHAPTER II 
(2.1) Write Rx = Z+ Zr, and for any a € R, write @ = aq + baT with 


ada € Zand ba € Z. Then f = min{ba : a € R, ba > Of. 


(2.2) This is due to Hurwitz [1]. A nice exposition of the proof is given in 
the appendix to Rosen [1]. 
(2.9) (a) See Shimura [1, (5.4.3), p. 124]. (b) See Shimura [1, exercise 5.8, 
p. 124). 
(2.12) (b) This exercise was suggested by David Rohrlich. 
(2.13) The minimal polynomial of 8 is 27x® + 72x* — 16. 
(2.14) Ky = Ly = K, K3 = K, Ls = K(*/4), Ka = K(V3), and 
La = K(V—-9+ 673). Gal(La/K) is cyclic of order 4. In computing 
La, the identity —10 — 6/3 = (—1 — V3)? is useful. 
(2.17) See Lang [1, Ch. 5, §1]. 
(2.18) (a,b,c,d) See Lang [1, Ch. 5, §2], Shimura [1, Ch. 4.6]. (f) See P. Cohen 
[1] and Silverman [4]. 
(2.19) See Lang [1, Ch. 5, §3]. 
(2.20) See Lang [1, Ch. 5, §2]. 
(2.21) See Lang [1, Ch. 5, Thm. 2]. 
(2.25) See Gross [1, Lemma 9.2.5]. 
(2.30) See Lang [{1, Ch. 10, §4, Thm. 10]. For the general case of abelian 
varieties, see Shimura [1, Thm. 7.46 and Prop. 7.47]. 
(2.33) See Ireland-Rosen [1, Ch. 18, 4]. 
CHAPTER It 
(3.4) The Mordell-Weil theorem for abelian varieties over finitely generated 
fields is due to Néron. See, for example, Lang [4, Ch. 6, Thm. 1]. 
(3.11) (c) See also Chapter VI text and exercises for an approach using local 
height functions which give a better estimate in (c). One can then use 
(c) to prove (a) and (b). 
(3.12) [0,1,0], [0, £7, 1}, [7, 0,1], [1 -7,4(1 — 27), J, 
[V2,+(2T — 1), 2V2]. 
(3.14) See Mumford [1, §6, Lemma on p. 56}. 


(3.15) 


(3.16) 


(3.17) 
(3.20) 


In general, the Mordell-Weil group of an abelian variety is finitely gen- 
erated in the following two cases: (i) (Néron) if the field of definition is 
finitely generated over Q. (ii) (Lang-Néron) if one takes the quotient 
by the subgroup of points defined over the constant field. For details, 
see Lang [4, Ch. 6, theorems 1 and 2]. 

This version of (III.11.3.1) and (II1.11.4) for split elliptic surfaces is 
due to Dem’janenko [1], with a generalization to abelian varieties due 
to Manin [1]. The example in (d) is due to Dem’janenko. 

See Kuwata [1,2]. 

(a) 2. (b) 3. (c) 2. (d) 4. 
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(3.21) 
(3.23) 
(3.24) 
(3.31) 
(3.32) 


(3.34) 
(3.36) 


(3.37) 
(3.38) 


Notes on Exercises 


This is a special case of Zariski’s Main Theorem, see Hartshorne [1, 
V.5.2, T1I.11.3, exercise [I.11.4]. 

This is a special case of the general fact that algebraically equivalent 
divisors are numerically equivalent, see Hartshorne [1, exercise V.1.7]. 
(b) det(Ioo) =n. (c) ai =i(n—k)/nif1<i<k, anda; =k(n—i)/n 
ifk <ai<n. See, for example, Cox and Zucker [1, Table 1.14]. 

See Lang [4, Ch. 4, Prop. 5.2}. 

See Lang [4, Ch. 4, Prop. 3.3 and Cor. 3.4]. 

This estimate is due to Tate [4], see also Lang [4, 12 Cor. 5.4]. 

(b) This is due to Kodaira, see Shioda [3, Prop. 2.8]. It has been 
frequently rediscovered, see for example Hindry-Silverman [2, Thm. 5.1] 
and Szpiro [1]. 

This result is due to Hindry and Silverman [2]. 

For an explicit construction of the Jacobian variety of a hyperelliptic 
curve, see Mumford [3, Ch. IIa]. In particular, Pic(C)(2] is described 
in Mumford [3, Ch. Ia, Lemma 2.4 and Cor. 2.11]. 


(3.40) See Shioda [3, Prop. 1.6]. 
CHAPTER IV 

(4.5) (a) See Matsumura [1, corollary to Thm. 45 (18.G)]. 
(4.7) See Bosch-Liitkebohmert-Raynaud [1, §2.4, Prop. 8]. 

(4.10) See Shatz [1, §2] or Waterhouse [1, Ch. 1 and 2]. 

(4.15) (a) This is a restatement of Lemma IV.9.5. 

(4.16) C7: (y? — 2°)(2 — 1)? (y — 2x) = 0. 

(4.19) See Artin [1, §0]. 

(4.24) See Bosch-Liitkebohmert-Raynaud [1, Ch. 7, Prop. 6]. 


(4.25) 


(4.26) 
(4.27) 


(4.29) 


(4.30) 
(4.31) 


(4.32) 


(4.35) 
(4.36) 


(4.37) 


Combine (IV.5.3) and (IV.9.1). For a more intrinsic proof for general 
group schemes, see Artin [1, Lemma 1.16]. 

See Greenberg [1, 83, Lemma 2] for a multi-variable version. 

(b) See Bosch-Liitkebohmert-Raynaud [1, 2.2, Prop. 7]. (c) See Bosch- 
Liitkebohmert-Raynaud [1, 2.3, Prop. 5]. 

See Milne [3, §7] or Weil [3]. 

See Artin [1, Cor. 1.6]. 

(c) The special fiber has five components, three of multiplicity 1 and 
two of multiplicity 2. This is a fiber of Type [X-1 in the classification 
of Namikawa and Ueno [1]. (d) The special fiber has four components, 
one of multiplicity 1, two of multiplicity 2, and one of multiplicity 3. 
This is a fiber of Type VIII-3 in the classification of Namikawa and 
Ueno [1]. 

(c) Raynaud [1] has proven a general result which allows one to compute 
the group of components on the Néron model of the Jacobian of a curve 
in terms of the incidence matrix of the special fiber of a minimal proper 
regular model of the curve. 

This exercise is taken from Tate (2, end of §6]. 

(a,b) This is an unpublished result of Mestre; see Kraus [1]. (c) This 
is due to Kraus [1]. 

See Néron [1, §III]. 
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(4.39) The proof is essentially the same as the proof of [AEC, IV.6.1]. See 
also exercises 2.22 and 2.23. 

(4.43) (a) {1}. (b) Z/2Z. (c) Hs. 

(4.44) This follows easily from [AEC, IX.6.1]. 

(4.45) See Serre [4, Ch. IV, Section 1]. 

(4.46) (b) See Serre [4, Ch. VI, Thm. 1’]. (d) See Ogg [2], Serre-Tate [1, §3], 
and Serre [7, Ch. 19]. (e) This is due to Ogg [2]. 


CHAPTER V 


(5.2) See Serre [4, VIII §4]. 
(5.13) This is due to Serre [1, (A.1.2), pp. IV-31,2.]. 
(5.14) This approach to Tate’s theorem for elliptic curves with non-integral 7 
is due to Shatz [1]. 


CHAPTER VI 


(6.5) See Lang [6, II §5}. 

(6.6) (a) This is due to Tate; see Lang [3, II, Thm. 4.5]. (b,c) See Silverman 
[3]. 

(6.7) See Silverman [2, Lemma 5.1]. 

(6.8) See Silverman [2, Thm. 5.2]. 

(6.9) The series (c) is due to Tate, and the extension to E(C) is due to 
Silverman. For proofs with error estimates, see Silverman [2]. 

(6.10) Solutions: (a) 0.02555... (b) h(P) = 0, P is a 5-torsion point, (c) 
0.01028 ..., this point has very small height, (d) 0.01049..., this point 
has very small ratio h(P)/log(A). 

(6.11) (b) This is due to Blanksby and Montgomery; see Hindry-Silverman 
[1]. (c) Hindry-Silverman [1]. (e) This is due to Elkies, see Lang [6, VI 
§5]. One can give the explicit lower bound 


1 
—5N log N - aN max{log |j(£)|, 0} — 3.64N, 


where the 3.64 is not best possible. 
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Bertini’s theorem, 233 
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Birch, B.J., 389 
Birch-Swinnerton-Dyer fudge factor, 364 
Blanksby, P., 487 
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generate ring of integers, 95 
good reduction, 184 
group of a-torsion points, 102, 135 
isogenous curves, 178, 180 
Iwasawa theory, 96 


Index 


j-invariant, 121, 122, 166 
in Q, 138 
is algebraic, 104 
is integral, 140, 147, 151, 176, 447 
j = 0, 102, 107, 177, 180 
j = 1728, 101, 107, 138, 185 
j = —3375, 111 
j = 8000, 110, 180 
lift of Frobenius map, 130, 132, 162 
L-series, 171, 175, 185 
main theorem, 157, 159, 166 
map F': Gal(K/K) > CL(Rx), 112, 
121 


number of points modulo $B, 175 
of degree two, 109, 141, 179 
ordinary reduction, 179 
potential good reduction, 140, 447 
reduction mod $8, 124, 127, 129, 131 
Tate module, 179, 186 
torsion generates abelian extensions, 
108, 135, 148, 149, 166, 168 
transitive action 
of Galois group, 122 
of ideal class group, 100, 113 
twist, 183 
Weber function, 134, 135 
with 7 € R, 142, 179 
Complex structure, 19 
Components of fiber 
group of, 363, 364 
multiplicity of, 313 
number on special fiber, 363, 364, 389 
non-reduced, 313 
of a divisor, 231 
of multiplicity one, 402 
reducible, 313 
singular, 313 
Conductor 
bounded, 404 
constant under isogeny, 404 
divisor of an elliptic surface, 287 
exponent of, 363, 364, 380, 389 
bound for, 385 
computation of, 389, 407 
less than discriminant, 396 
maximum value, 387, 407 
over 2-adic field, 407 
over 3-adic field, 407 
independent of @, 381, 405 
of a Gréssencharacter, 176 
of an abelian extension, 117, 118, 123 
of an elliptic curve, 379-388 
of an order, 178 
of ray class field, 117, 118, 129, 135 
square-free, 388 
Szpiro’s conjecture, 287, 388 
tame part, 380, 381, 405 
wild part, 380, 381, 405 
Conductor-discriminant formula for ellip- 
tic curves, 389 
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Connected component of Néron model, 
326, 361, 401 
Connected fibers, 240, 283, 342, 353 
Contravariant functor, 309 
Correspondence, 68 
Hecke, 68 
See also Hecke operator 
homothety, 68 
on set of lattices, 68, 90 
ring of, 70 
Cox, D., 486 
Cremona, J.E., 389 
Criterion of Néron-Ogg-Shafarevich, 362 
Curve 
See also Riemann surface 
ample divisor on, 257, 264 
automorphism group, 294 
canonical divisor, 27, 87 
differential form on, 207 
exceptional, 236, 344 
height on, 257, 264, 265 
holomorphic differential on, 198 
homology of, 198 
hyperelliptic, 287, 486 
Jacobian variety of, 197, 199, 402 
minimal proper regular model of, 317, 
318, 344, 361 
normal law on symmetric product, 402 
of genus two, 287 
Picard group of, 192, 195, 199, 402 
projective, 27, 87 
proper regular model of, 317 
symmetric product, 229 
uniformizer for, 339 
Cusp, 287, 370 
of X(1), 14 
of H*, 14 
stabilizer of, 15 
Cusp form, 25, 91 
See also Modular function, modular 
form 
action of Hecke operator, 76, 78, 79, 92 
basis of eigenfunctions, 78, 80, 92 
dimension of space of, 31 
discriminant is a, 26, 31 
Fourier coefficients of, 79, 80, 81 
L-series attached to, 83, 85, 93 
simultaneous eigenfunction, 79, 80, 92 
size of Fourier coefficients, 81 
space of (M8..), 31, 78 
spaces with dimension one, 34 
twisted L-series attached to, 94 
twisted by yx, 93 
Cyclotomic extension, unramified, 383 
Cyclotomic field, 95, 128, 151 
is abelian, 108, 128, 151 


Dn, 72, 91, 143 
action of SL2(Z), 72 
DF, 181 
Decomposition group, 116, 331 
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Dedekind, R., 67 
Dedekind domain, 113, 179 
arithmetic surface over, 311 
is regular scheme, 302 
Dedekind 7 function, 65 
transformation formula for, 67, 90 
Dedekind reciprocity, 67, 90 
Dedekind sum s(z, y), 66, 89 
Degree map on divisors, 197 
Degree one prime, in ideal class, 118, 
123, 161 
Degree 
non-square is CM, 143 
of a CM isogeny, 103, 124 
of a divisor, 232, 282 
of isogeny is preserved under reduc- 
tion, 124 
Deligne, P., 61, 81 
Dem’janenko, V.A., 265, 269, 271, 485 
Descent theorem, 230 
Descent, faithfully flat, 338 
Determinant 


homomorphism of group varieties, 292 


kernel of, 292 
Deuring, M., 95, 175 
Diagonal 
height associated to, 285 
is complete intersection, 328 
morphism, 300, 309, 327 
Different, 176, 386 
Differential form, 207 


associated to a modular function, 28, 


31, 87, 91 

divisor of, 27 

holomorphic, 27 

invariant, 43, 183 

of the first kind, 43 

of the second kind, 43 

of the third kind, 43 

on a curve, 198 

on a Riemann surface, 27 

on elliptic curve, 43 

on H, 26 

on X(1), 23 

order of, 27 

regular, 27 

sheaf of relative, 306 
Dimension 

of a point on a scheme, 302 

of a scheme, 302 

of Jacobian variety, 197 


regular scheme of dimension one, 302 


Dirichlet character, primitive, 93 
Dirichlet series 
See also L-series 
analytic continuation, 83, 94 
attached to a cusp form, 83, 85 
attached to a power series, 80 


Euler product expansion, 80, 92, 172, 


173 
functional equation, 83, 94 


Index 


half-plane of convergence, 83, 94 
relations satisfied by coefficients, 80, 
92 


Dirichlet’s theorem on primes in progres- 


sions, 118, 123 


Discrete subgroup 


of Qp, 422 
of Q*, 422 


Discrete valuation ring, 328 


arithmetic surface over, 311 
closed fiber over, 300 
generic fiber over, 300 
Henselian, 330, 401 
Henselization of, 331 
smooth scheme over, 304 
special fiber over, 300 
strict Henselization of, 331 
strictly Henselian, 330, 402 
unramified extension, 399 


Discriminant function, Fourier expansion, 


59, 60, 467, 468 


Discriminant 


Fourier coefficients, 482 
See also Ramanujan 7 function; Ja- 
cobi’s formula 
is eigenfunction for Hecke operators, 
78, 79 
Minimal, 363, 364, 388, 389, 396 
for elliptic surface, 286 
over 2-adic field, 407 
over 3-adic field, 406 
modular, 26, 31, 32, 62, 89, 135 
of a cubic polynomial, 62 
of a Weierstrass equation, 62, 183, 455, 
458, 470, 478 
of isogenous curves, 453 
of Tate curve Eg, 377, 423, 424, 473 
product expansion, 60, 62, 90, 409, 468 
vanishes only at oo, 32 
weight 12 cusp form, 26, 31, 32, 34, 66 


Div 


See Divisor group 


Division polynomial wm, 477 
Divisor class group 


of P!, 271 
of a curve, 
See Picard group 
of a function field, 
See Picard group 


Divisor group, 231 


induced homomorphism, 255 
of a fibered surface, 284 

of a set, 68 

of a variety, 255 

of an arithmetic surface, 339 


Divisor 


algebraic equivalence, 254, 285, 486 
ample, 233, 257, 258 

associated to a morphism, 258 
canonical height, 285 

components of, 231 


Index 


degree of, 232, 282 
exceptional, 344 
fibral, 246, 247, 284, 340 
group of, 231 
height associated to, 256, 265 
horizontal, 340 
integer part of, 87 
intersection pairing, 233, 238 
is difference of very amples, 261, 284 
linear equivalence, 232, 339 
local equation, 232, 233 
minimal discriminant, 286 
numerical equivalence, 486 
of a differential form, 27 
of a section, 245 
of an elliptic function, 45 
on a surface, 231 
on an arithmetic surface, 311, 339 
positive, 285, 341 
principal, 232, 339 
self-intersection, 234, 238, 243, 283, 
342, 349, 351 
Serre’s theorem on ample, 261 
transversal intersection, 232, 282 
very ample, 257, 261, 284 
Weil, 339 
Domain 
of a rational map, 204, 279 
of morphism to group scheme, 327, 
333, 337 
Dominant rational map, 204, 222, 228 
from smooth scheme to proper scheme, 
328 
induces map on function fields, 205 
of elliptic surfaces, 208 
Dual graph of special fiber, 353 
Dual isogeny, 67, 125, 229 
of Néron model, 400 
Duplication formula, 139, 427, 437 
for height, 257 
for local height, 456, 471, 475 
has degree four, 458 
on an elliptic curve, 210, 214, 457, 471 
Dynkin diagram, 353 


Effective bound for S-integer points, 
277, 288 
EFigenfunction 
basis of M$, 80, 92 
for Hecke operators, 77 
normalized, 78, 79, 80, 92, 93 
simultaneous for all T2,’s, 78, 79, 80, 
92, 93 
Hisenstein series, 25, 91 
algebra generated by, 88 
Fox, 58 
Fourier coefficients of, 58 
g2 and g3, 26, 35, 58 
G4 and G¢ algebraically independent, 
88 
G4 and Gg determine A, 35, 88 
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Gor, 25, 31, 32, 55 
is a modular form, 25 
is eigenfunction for Hecke operators, 
78, 93 
normalized, 58 
q-expansion of, 55 
special value of, 178 
value at oo, 25 
Elkies, N., 487 
Elliptic curve 
addition formula, 323 
automorphism group, 183 
bad reduction, 185 
classification of special fibers, 350, 352 
conductor, 363, 364, 388 
endomorphism of degree two, 109, 141 
endomorphism ring R, 98 
endomorphisms are integral, 448 
existence of Néron model, 325, 332, 
335, 337 
field of definition, 37, 38 
field of moduli, 37, 38 
formal group, 276 
Galois conjugate, 131 
good reduction, 184 
group of a-torsion points, 102, 135 
Gréssencharacter of, 165, 168, 174, 175 
height over function field, 212, 213 
homogeneous space of, 199 
invariant differential, 43, 97, 134 
is abelian variety, 196 
is group variety, 291 
isogeny, 67, 182, 183 
j-invariant 
j = 0, 102, 107, 177, 180, 280 
j = 1728, 101, 107, 138, 185, 280 
j = —3375, 111 
j = 8000, 110, 180 
non-integral has End(F) = Z, 447 
é-adic representation, 405, 445 
L-series, 171, 172, 175, 183, 185, 364 
map to P!, 134 
minimal discriminant, 363, 364 
minimal proper regular model, 325, 
332, 335, 337, 350, 352, 361, 400 
multiplication map, 310 
multiplicative reduction, 403 
negation map, 325 
Néron model, 319 
normalized, 97, 131, 134 
of high rank, 272 
ordinary reduction, 184 
over C 
See Elliptic curves over C 
over function field 
See Elliptic surface 
over locally compact field, 476 
over p-adic fields 
See Elliptic curves over p-adic fields 
over R 
See Elliptic curves over R 
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Elliptic curve (continued) 
potential good reduction, 140, 447 
reduction modulo $B, 131 
reduction type, 352, 353, 363, 364 
representation on m-torsion, 108, 383 
supersingular reduction, 184 
translation map, 310 
twists of, 283, 414, 439 
v-adic topology 
See v-adic topology, on an elliptic 
curve 
Weber function, 134, 135 
with complex multiplication 
See Complex multiplication 


Elliptic curves over C, 408-413 


analytic parametrization, 6, 34, 408, 
411, 464, 468 

discriminant, 26, 409, 411, 464, 467 
See also Discriminant 

homology group, 37, 43 

isomorphism class of, 36, 37 

j-invariant, 34, 36, 411 
See also j-invariant 

lattice of, 408, 468, 477 

local height, 464, 466, 468, 477 

periods, 37, 43 

Uniformization Theorem, 6, 34, 35, 88, 
96, 196, 411 

Weierstrass equation, 34, 409, 464 


Elliptic curves over non-archimedean 


fields, 424 


Elliptic curves over p-adic fields, 438-444 


See also Tate curve Eq 

additive reduction, 451 

applications, 445-448 

y-invariant, 439, 441, 451 
not equal one, 444 

local height, 469, 472, 480 

multiplicative reduction, 377, 478 

non-split multiplicative reduction, 378, 
451 

quadratic character, 440, 444 

split multiplicative reduction, 377, 441, 
451, 452, 472, 480 

Tate module of, 452 

torsion points, 383, 445 

uniformization, 141, 423, 441, 444, 473, 
480 

Weil-Chatelet group, 452 


Elliptic curves over R, 413-422 


C isomorphism classes, 414 
y-invariant, 414, 419, 450 

not equal one, 450 
homogeneous spaces, 449 
isomorphic to real Lie group, 420 
Kummer sequence, 449 
number of components, 419 
R isomorphism classes, 414, 416 
sign of the discriminant, 420, 421, 449 
torsion subgroup, 449 
twists, 414, 450 


Index 


uniformization over R, 416 
Weil-Chatelet group, 421, 449 
with CM, 142, 179 


Elliptic function, 39 


See also Weierstrass g function, 
Weierstrass o function, Weierstrass 
¢ function 

divisor of, 45 

factorization as product of o’s, 45 

Laurent series of F(u;q), 49 


Elliptic integral, 178 
Elliptic surface 


See also Fibered surface, surface 
algebraic equivalence, 254 
as subset of P? x C, 200 
associated elliptic curve, 206 
automorphism, 245 
bad fiber, 203 
birational equivalence, 206 
canonical divisor, 249 
canonical height on, 217, 247, 252, 265, 
266, 269, 281, 284, 286 
classical theory, 187 
conductor divisor, 287 
definition of, 202, 203, 205 
divisor Dp, 247 
divisor of a section, 245 
dominant rational map, 208, 222, 228 
fiber, 201, 202 
fiber product, 211, 301 
field of definition, 203 
of section, 189, 282 
generic fiber, 206 
good fiber, 203 
group of sections, 202, 210, 298 
defined over k, 203, 210 
height on, 212, 213, 265 
infinitely many points of bounded 
height, 220, 222 
is a C-scheme, 298 
j-invariant 
is algebraic function, 280 
constant, 286 
Kummer sequence for, 193 
lattice structure, 254 
lower bound for height, 287 
map on Pic is injective, 253 
minimal discriminant divisor, 286 
Mordell-Weil theorem, 230, 276, 279 
multiple fibers, 203 
multiplication-by-n map, 266 
Néron-Severi group, 254 
non-singular fiber, 286 
non-split, 218, 247 
with zero discriminant, 286 
one-cocycle on sections, 284 
one-parameter family, 188 
ord;(x) is bounded, 275 
over non-perfect field, 187 
points not a group, 211 
rational map between, 206 


Index 


S-integral points on, 275, 277, 288 
section, 202 
of order eleven, 190, 279 
of order five, 278 
of order seven, 278 
singular fiber, 203 
specialization map, 271 
is injective, 271, 281 
of height, 265, 266, 269, 281, 286 
split, 220, 222, 228, 231, 271, 280, 281 
over P!, 286 
subgroup Eo(K), 251, 284, 288 
torsion subgroup, 288 
translation map, 245, 248, 251 
weak Mordell-Weil theorem, 191-195, 
230 
zero section, 202 
Endomorphism 
field of definition of, 106 
of degree two, 109, 141, 179 
of an elliptic curve, 448 
Endomorphism ring 
commutator subring, 129 
elliptic curve with given, 121 
normalized isomorphism, 97, 129, 131, 
134 
quaternion algebra, 130 
reduction modulo $B, 129 
Eq, See Tate curve Eg 


Etale morphism, 306 


Etale topology, 332 
Eta function (7) 
See Quasi-period homomorphism, 
Dedekind 7-function 
Euclidean structure on Mordell-Weil 
group, 247 
Euler product, 80, 92, 172, 173 
Excellent scheme, 311 
Exceptional curve, 236, 244 
Exceptional divisor, 344 
Castelnuovo’s criterion, 344, 352 
on an arithmetic surface, 344 
Exponent of the conductor, 380, 389 
bound for, 385 
computation of, 389, 407 
independent of @, 381, 405 
less than discriminant, 396 
Exponential function, special values, 151 
Exponential map on multiplicative 
group, 151 
Extended upper half plane H*, 14 
action of P'(1), 14, 85 
cusps of, 14, 85 
is a Hausdorff space, 16 
topology on, 16, 85 


Faithfully flat 

descent, 338 

strict Henselization is, 338 
Faltings, G., 271 
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Fermigier, T., 272 
Fiber 
archimedean, 345 
closed, 300 
generic, 300 
multiplicity of components, 313 
of a scheme over a point, 300 
of an arithmetic surface, 313 
of an elliptic surface, 201 
singular point, 314 
smooth, 397 
special, 300 
Fiber product, 299 
diagonal, 300, 309, 327 
graph in, 300 
group operation on, 211 
of elliptic surface, 211, 301 
universal property, 299 
Fibered surface, 236 
See also Surface, elliptic surface 
connected fibers, 240, 283, 342 
fiber of, 236 
genus at least one, 244 
fibral curve, 237 
fibral divisor, 237 
fibral homomorphism, 237 
fibral intersections, 238 
horizontal curve, 237 
horizontal divisor, 237 
map on Picard group, 284 
is injective, 253 
minimal, 244 
multiplicity of fibers, 237 
negative semi-definite intersection pair- 
ing, 238 
relatively minimal, 243 
Fibral curve, 237 
Fibral divisor, 237, 246, 247, 284 
group of, 340, 344 
intersects fiber trivially, 238 
on an arithmetic surface, 340 
self-intersection, 238, 342, 351 
Field 
locally compact, 476 
non-perfect, 187 
of characteristic zero, 187, 189 
perfect, 330, 332 
totally imaginary, 116 
Field of definition 
for E(C(T)), 282 
of an abelian variety, 196 
of an elliptic curve, 37, 38 
of an elliptic surface, 203 
of Jacobian variety, 199 
of section to elliptic surface, 189, 282 
Field of moduli of an elliptic curve, 37, 
38 
Finite group scheme, 397 
Finite order elements in Picard group, 
192, 195, 199 
Finite type scheme, 311 
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First kind differential form, 43 
Fischer-Griess monster group, 61 
Flat family, arithmetic genus in, 343 
Flat morphism, 304, 397 
Flat scheme, 311, 345 
Formal Dirichlet series, 80 
Formal group 
of an elliptic curve, 276, 381 
of Tate curve Eq, 431 
Formal logarithm, 150, 182 
Formal multiplicative group, 149, 431 
Fourier analysis, 173 
Fourier coefficients 
See also g-expansion 
action of Hecke operator, 76, 78, 79 
integer, 145 
of a simultaneous eigenfunction, 79, 80 
relations satisfied by, 79, 80 
size of, 81, 82, 92 
Fourier expansion 
See q-expansion 
Fractional ideal 
is a lattice, 99 
of quadratic imaginary field, 85 
Free module of rank one, 102, 138, 186 
Frey, G., 453 
Fricke, M., 95 
Frobenius map 
action on Tate module, 172 
automorphism, 116, 128, 129, 130, 132, 
161 
characteristic polynomial, 172 
lift to characteristic zero, 130, 132, 162 
of varieties, 132 
Fudge factor, 364 
Function field 
birational equivalence, 205 
canonical height is rational, 219, 247 
elliptic curve over 
See Elliptic surface 
finite subgroup of, 195 
height on, 212 
elliptic curve over, 212, 213 
ideal class group, 192 
infinitely many elements of bounded 
height, 213 
Kummer theory, 191 
local heights, 212 
map induced by dominant rational 
map, 205 
Mordell- Weil theorem, 230, 279 
S-integers, 275 
Szpiro’s conjecture, 287, 388 
unit group of, 191 
valuation on, 194 
Functional equation 
of Dirichlet series attached to cusp 
form, 94 
of L-series, 176, 184 
of L-series attached to cusp form, 83, 
94 


Index 


sign of, 176 
for O(u,q), 412, 429, 474 
for X(u,q), 425 
for Y(u,q), 425 
Functor 
contravariant, 299, 309 
defined by a group scheme, 309 
defined by an S-scheme, 298, 306 
Jacobian, 198 
Functoriality of height function, 256 
Fundamental domain (F) for action of 
SL2(Z) on H, 10, 14, 82, 92 
Fundamental domain, for C/A, 41, 71 


Gor, See Eisenstein series 
y-invariant 
in characteristic two and three, 451 
of an elliptic curve, 439 
defined over R, 414, 419 
I(1), See Modular group (1) 
To(N), 87 
T(N), 87 
TN), 86 
I'(s), See Gamma function 
Galois action 
on series, 424, 428, 451 
on special fiber, 353 
Galois cohomology 
of a cyclic group, 421 
of a discrete group, 428 
Galois conjugate of a CM elliptic curve, 
131 
Galois group 
decomposition group, 331 
higher ramification group, 379 
inertia group, 120, 149, 331 
of Hilbert class field, 118 
Galois theory, 37 
Gamma function, 83, 176 
incomplete, 93 
Gauss, K.F., 141 
Gauss sum, 93 
Gaussian integers, 178 
Gel’fond-Schneider theorem, 108, 142 
General linear group, 291 
centralizer of a subgroup, 179 
determinant, 292 
elements of finite order, 403, 404 
formal logarithm, 150, 182 
rational points of, 292 
Generic fiber, 300 
empty, 401 
of an arithmetic surface, 311 
of an elliptic surface, 206 
over a DVR, 300 
Generic point, 304, 328 
of a scheme, 300 
of Spec(R), 300 
Genus 
arithmetic, 342, 351 
in flat family, 343 


Index 


of a smooth curve, 27, 87 
of the modular curve X(1), 20, 21 
two, special fiber of curve of, 355 
zero, 343, 352 
GLn, See General linear group 
Global sections, 397 
Good fiber, 203 
Good reduction, 184, 287, 321, 329, 383, 
388 
conductor, 381 
Groéssencharacter is unramified, 168 
potential, 140, 148, 151, 176, 383 
torsion for, 453 
Graded algebra, 27, 88 
Graph of a morphism, 300 
Green’s function, 477 
Group 
algebraic 
See Group variety 
pro-p, 150, 182 
sporadic, 61 
topological, 85 
Group chunk, 198, 334 
Group cohomology, 193 
of a cyclic group, 421, 449 
Group law 
induced by normal law, 334, 337 
is normal law, 334 
Group of components 
of a group variety, 292 
of Néron model, 350, 362, 363, 364, 
379, 402, 486 
of special fiber, 363, 364 
Group of sections to an elliptic surface, 
202, 203, 210, 298 
Group scheme, 306, 398 
action, 321, 326, 400 
additive group, 307, 397, 398, 399 
associated functor, 309 
associated to normal law, 334, 337 
associativity, 307 
diagonal morphism, 309 
extension of group law, 332, 335 
finite, 397 
identity component of special fiber, 
326, 361 
identity section, 306, 326 
in characteristic p, 397 
inverse map, 306 
local ring along identity, 327 
morphisms to, 327, 333, 337 
multiplication map, 310, 398 
multiplicative group, 308, 397, 398, 
399 
Néron model, 319 
of roots of unity, 397, 398 
proper, 327 
set of T-valued points, 309, 397 
smooth part of Weierstrass equation, 
321, 362, 378, 399, 400 
smooth Weierstrass equation, 329 
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translation map, 310, 336, 337, 400 
Group variety, 291 

abelian, 196 

additive group, 291, 293, 398 

algebraic, 115 

defined over K, 291 

determinant homomorphism, 292 

elliptic curve, 291, 293, 398 

general linear group, 291 

homomorphism, 292 

identity component, 115, 292 

irreducible components, 292 

is a group scheme, 307 

is non-singular, 292 

Jacobian, 402 

linear group, 292, 396 

multiplicative group, 291, 293, 398 

of dimension one, 293, 398 

orthogonal group, 396 

rational points of, 292 

special fiber of Néron model, 361 

special linear group, 292 

special orthogonal group, 396 

symplectic group, 396 
Gréssencharacter, 156, 165, 168, 173, 184 

conductor of, 176 

for 7 = 0 curve, 177 

L-series, 171, 173, 175, 185 

of twisted curve, 183 

on Q, 156 

reduction modulo $B, 174 

unramified, 168, 173, 174 


Half-plane of convergence 
of L-series attached to cusp form, 83 
of L-series attached to elliptic curve, 
183 
Hardy, G.H., 61 
Hasse, H., 95 
Hausdorff space, 16, 18, 19, 85 
Hecke, E., 81, 173, 484 
Hecke algebra, 70 
is commutative, 71 
of [(1), 70 
Hecke L-series, 173 
Hecke operator, 67-74 
See also Hecke algebra 
action on a cusp form, 76, 78, 79, 92 
action on a lattice function, 74 
action on a modular form, 76 
action on a modular function, 76, 79, 
91 
action on Fourier coefficients, 76, 78, 
79 
as shifting operator, 76 
commutativity of, 71 
definition of, 68 
discriminant is eigenfunction, 78 
eigenfunction for, 77 
explicit formula for, 71, 73, 74, 90 
recursive formula for, 90 
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Hecke operator (continued) 
relations satisfied by, 68, 79, 90 
self-adjoint, 92 
simultaneous eigenfunction for all, 78, 
79, 80, 92, 93 
T(p), 74, 7 
Heegner, 141 
Height 
additivity, 256, 261 
and linear equivalence, 256 
associated to a divisor, 256, 265 
associated to a morphism, 258 
associated to a positive divisor, 285 
associated to an ample divisor, 257 
associated to diagonal of curves, 285 
associated to exceptional curve, 285 
canonical, 217, 247, 265, 266, 269, 281, 
286 
canonical 
See Canonical height 
duplication formula, 213, 257, 280 
explicit O(1) estimate, 280 
finitely many points of bounded, 257, 
265 
for algebraically equivalent divisors, 
285 
functoriality, 256 
geometric transformation properties, 


213 

infinitely many elements of bounded, 
213, 220 

Néron-Tate, 217, 247, 265, 266, 269, 
281, 286 


See also Canonical height 
normalization, 256 
on a function field, 212 
on a variety, 256 
on an elliptic surface, 212, 213, 265 
on curves, 257, 264, 265 
on projective space, 255 
parallelogram law, 213 
quasi-parallelogram law, 280 
regulator, 273 
specialization of, 265, 266, 269, 281, 
286 
sum of local, 212 
Height Machine, 256 
verification of, 262 
standard properties, 267, 268 
Hensel’s lemma, 330, 331, 401, 443 
multi-variable version, 401 
Henselian ring, 330, 401 
complete ring is, 330 
strictly, 330, 402 
surjectivity of reduction, 330, 333, 337, 
401 
Henselization 
fraction field, 332 
of a DVR, 331 
strict, 337 
universal mapping property, 332, 401 
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Hermitian inner product, 92 
Higher ramification group, 379 
index function for, 405 
inertia group, 380 
is normal, 380 
over 2-adic field, 404 
over 3-adic field, 404 
Hilbert class field, 95, 118, 130, 132, 142, 
181 
abelian extension of, 134 
Artin map for, 118 
Galois group of, 118 
generated by j(/), 121, 122, 166 
of Q(./—15), 180 
of Q( /—23), 180 
of quadratic imaginary field, 121, 122 
Hilbert irreducibility theorem, 272 
Hilbert theorem 90, 193, 421 
Hindry, M., 277, 278, 486, 487 
Holomorphic differential form, 27, 207 
on a curve, 198 
Homogeneous space of an elliptic curve, 
199 
Homology group, 37, 43 
of a curve, 198 
Homomorphism 
of groups, 427 
of group varieties, 292 
Homothetic lattices, 6, 9, 14, 37, 101, 
103, 161 
Homothety operator, 68, 90 
relations satisfied by, 68 
Horizontal curve, 237 
intersection with, 283 
Horizontal divisor, 237 
on an arithmetic surface, 340 
Hurwitz, A. 101, 294, 485 
Hyperelliptic curve 
integer points on, 277 
Jacobian variety, 199, 287, 486 
of genus two, 287 
over function field, 277 
Picard group, 287 


Ideal 
of an idele, 119, 152, 159 
principal, 132 
Ideal class group, 99, 180 
See also Hilbert class field 
acts on ELL(RK), 99 
acts on ELL(R») transitively, 100, 
113 
algorithm to compute, 85 
class contains degree one primes, 118, 
123 
is finite, 86 
is Picard group, 192 
of a function field, 192 
of quadratic imaginary field, 85 
Idele 
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ideal of, 119, 152, 159 
multiplication by, 159, 170 
Idele group, 119 
characterization of ray class fields, 120, 
162 
class field theory using, 118-120 
contains K*, 119 
contains K¥, 119 
norm map, 119 
topology on, 119 
Identity component 
of a group variety, 115, 292 
of Néron model is Weierstrass equa- 
tion, 362, 378 
Identity element Néron model, 326 
Identity section of a group scheme, 306 
Image of a rational map, 204, 279 


Incidence matrix, 240, 241, 243, 283, 350, 


402, 403, 486 
Incomplete gamma function, 93 
Index function for higher ramification 
groups, 405 
Inert prime, 184 
Inertia group, 120, 149, 169, 331, 380, 
445 
absolute, 380 
Inner product 
Hermitian, 92 
Petersson, 92 
positive definite, 92 
Inseparable isogeny, 127 
Integer points 
effective methods, 277, 288 
on an elliptic surface, 275 
Integers of a function field, 275 
Integral scheme, 311 
Integral 
elliptic, 178 
not path independent, 198 
Intersection 
self, 234, 238, 243, 283, 342, 349, 351 
transversal, 232, 282 
Intersection index 
Arakelov, 344 
computation of, 283 
example, 349 
linear equivalence, 341 
local, 233, 283 
on pis 340, 341 
on an arithmetic surface, 339, 341 
sum of local indices, 234 
symmetric, 341 
Intersection pairing, 233 
Arakelov, 344 
incidence matrix, 240, 241, 243, 283 
on an arithmetic surface, 341, 342, 353 
on fibered surface, 238 
symmetric, 341 
Invariant differential, 43, 97, 134, 183 
Inverse on a group scheme, 306 
Irreducibility theorem of Hilbert, 272 
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Irreducible topological space, 280 
Isogeny, 67, 182 
between CM curves, 178, 180 
between Tate curves, 453 
comparison of discriminants, 453 
conductors are equal, 404 
degree preserved under reduction, 124 
dual, 67, 125, 229, 400 
equations for, 183 
field of definition of, 105 
is a homomorphism, 396 
lift of Frobenius map, 130, 132, 162 
of Néron model, 400 
purely inseparable, 127 
reduction mod ‘B, 124, 127, 129 
Isomorphism birational, 204 
Iwasawa module, 186 
Iwasawa theory, 96 


j-invariant 

classifies elliptic curves, 36 

CM in Q, 107, 138 

computation of, 142, 181 

divisibility properties of Fourier coeffi- 
cients, 60 

elliptic surface with constant, 286 

Galois conjugate of, 112 

generates Hilbert class field, 121, 122, 
166 

growth properties of Fourier coeffi- 
cients, 61 

in Z, 141 

integral, 383, 405 

integrality properties, 140, 151, 447 

j = 0, 102, 107, 180, 280 

j = 1728, 101, 107, 138, 280 
Groéssencharacter, 185 
number of points modulo p, 185 

j = —3375, 111 

j = 8000, 110, 180 

map X(1) — P!(C), 23, 34, 36, 88 

modular, 34, 88, 121, 143 

modular function of weight zero, 34, 
112, 144 

non-integral, 382, 405 

of an elliptic surface, 280, 288 

of CM curve is algebraic, 104 

of CM curve is integral, 140, 147, 151, 
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q-series, 59, 60, 121, 141, 142, 144, 
145, 146, 410 


rationality properties, 37, 104 
singular, 104 
supersingular, 104 
Tate curve Eg, 423, 425, 438, 473 
transcendental values, 108 
valuation of, 362, 379 
Jacobi, C., 62 
Jacobi sum, 177 
Jacobi’s formula, 60, 62, 65, 90, 409, 468 
Jacobian matrix, 330, 401 
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Jacobian variety, 197, 199, 402 
canonical height on, 271 
construction of, 197 
curve of genus two, 287 
dimension of, 197 
field of definition, 199 
is a functor, 198 
Néron model, 350 
of a homogeneous space, 199 
of a hyperelliptic curve, 199 
of genus one curve, 197 
of genus zero curve, 197 
of hyperelliptic curve, 486 

Jordan normal form, 179 

Jugendtraum, 96 


Kernel of Artin map, 117, 118, 179 
Kodaira, K., 287, 352, 486 
Kodaira-Néron classification of fibers, 
350, 352 
Kodaira notation for special fiber, 353 
Kodaira reduction type, 352, 353, 363, 
364 

Kouya, T., 272 
Kramer, K., 385 
Kraus, A., 486 
Kronecker, L., 95 
Kronecker congruence relation, 182 
Kronecker Jugendtraum, 96 
Kronecker limit formula, 484 
Kronecker-Weber theorem, 95, 129 
Krull dimension, 301, 302 
Kummer extension, 383, 446 
Kummer sequence 

for an elliptic surface, 193 

for fields, 193 
Kummer theory for function fields, 191 
Kuwata, M., 189, 485 
Kuyk, W., 389 


L-series 

See also Dirichlet series 

analytic continuation, 83, 93, 94, 173, 
176 

attached to a cusp form, 83, 85, 93 

attached to a modular form, 93 

attached to a power series, 80 

attached to a twisted cusp form, 94 

elliptic curve, 364 

Euler product expansion, 80, 92, 172, 
173 

for 7 = 0 curve, 178 

functional equation, 83, 93, 94, 173, 
176, 184 

half-plane of convergence, 83, 172, 183 

Hecke, 171, 173, 175, 185 

integral representation, 84, 85, 94 

leading coefficient, 364 

local of F at p, 171, 184, 185 
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of an elliptic curve, 171, 172, 175, 183, 
185 
relations satisfied by coefficients, 80, 
92 
residue of, 93 
special values of, 93 
£-adic cohomology, 172 
Lang, S., 231, 275, 285, 485 
Lang’s conjecture, height lower bound, 
287 
Laska’s algorithm, 364 
Laska, M., 364 
Lattice 
associated to a Weierstrass equation, 
35, 88, 89 
Euclidean, 254 
Hecke correspondence on, 68, 90 
homothetic, 6, 9, 14, 37, 101, 103, 161 
homothety correspondence on, 68, 90 
normalized, 47 
normalized basis, 7 
of an abelian variety, 196, 198 
of an elliptic curve, 408 
oriented basis, 6, 9, 71, 72, 73, 89 
set of all (£), 6, 36, 68, 74 
sublattice, 67 
sublattice of index n, 71, 72, 73 
Lattice function, 74 
associated to a modular function, 74, 
79 
Laurent series 
for F(u;q), 49, 51 
for g, 39, 51 
for ¢, 39 
p-adic, 429 
Lefschetz principle, 199, 463 
Legendre relation, 41, 465, 469 
Lehmer, D.H., 61 
Lehner, J., 60 
L’Hopital’s rule, 45 
Lichtenbaum, S., 317, 338, 344 
Lie group 
complex, 48 
real, 420 
Limit formula of Kronecker, 484 
Linear equivalence 
and height functions, 256 
intersection index, 341 
of divisors on a surface, 232 
Linear fractional transformation, 294 
Linear group, 292, 396 
Linearly equivalence, 339 
Lipman, J., 317 
Littlewood, J., 61 
Liu, Q., 390 
Local L-series of EF, 171, 184, 185 
Local class field theory, 140, 148, 149, 
452 
Local degree nz, 461 
Local equation for a divisor, 232, 233 
Local field 
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Artin character, 405 
different, 386 
higher ramification group, 379 
index function, 405 
inertia group, 380 
maximal unramified extension, 381 
Swan character, 405 
Local height function, 429 
algorithm to compute, 478, 479 
associated to v, 461 
comparison with naive height, 478 
differential equation for, 477 
duplication formula, 456, 475 
existence of, 455-461, 476 
archimedean absolute values, 467 
non-archimedean absolute values, 
470, 475 
integral over E:(C), 477 
is a Green’s function, 477 
is continuous, 456 
is invariant under field extension, 456, 
476 
is well-defined, 456, 470 
is zero for almost all v, 461 
lower bound for, 480 
multiplication formula, 477 
on Eo(K), 469, 478 
on Tate curve Eq, 473, 475 
over C, 464, 466, 477, 479, 480 
over non-archimedean fields, 469-476 
over p-adic fields, 469-476, 478, 479, 
480 
over R, 479 
quasi-parallelogram law, 467, 476 
triplication formula, 463 
Local intersection index, 233 
on Ph 340, 341 
on an arithmetic surface, 339 
Local Néron height function 
See Local height function 
Local parameter, 19, 20, 29, 30, 85 
Local ring 
is discrete valuation ring, 232 
of a curve on an arithmetic surface, 
311, 313, 339 
of a surface at a curve, 231 
of a surface at a point, 231 
of arithmetic surface, 315 
of group scheme along identity, 327 
regular, 302, 370, 397 
regular of dimension two, 315 
Localization, 179 
Locally compact field, 476 
Lockhart, P., 385 
Logarithm 
formal, 150, 182 
principal branch, 44, 54, 56, 66 


Magic, 48 
Main theorem of complex multiplication, 
157, 159, 166 
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Manin, Yu., 247, 265, 269, 271, 275, 485 
Mason, R.C., 277, 278 
Masser, D., 278 
Matrix 
incidence, 350, 402, 403, 486 
Jacobian, 330, 401 
Maximal abelian extension 
of exponent m unramified outside S, 
191 
of quadratic imaginary field, 135 
Maximal unramified extension 
abelian 
See Hilbert class field 
of a local field, 381 
Meromorphic function 
j is a, 23, 35 
on X(1), 23, 35 
on P!, 35 
on H, 24 
Mestre, J.-F., 272, 486 
Minimal discriminant, 363, 364, 388, 389 
greater than conductor, 396 
of an elliptic surface, 286 
of isogenous curves, 453 
over 2-adic field, 407 
over 3-adic field, 406 
Szpiro’s conjecture, 287, 388 
valuation one, 369, 399 
valuation two, 399 
Minimal fibered surface, 244 
birational map is morphism, 244 
Minimal proper regular model 
components of special fiber, 389 
connected component, 326, 361, 401 
identity component of special fiber, 
326, 361 
number of components of special fiber, 
363, 364 
of a curve, 317, 318, 344 
of an elliptic curve, 325, 332, 335, 337, 
350, 361, 400 
reduction type, 352, 353, 363, 364 
special fiber, 350, 352, 403 
Minimal surface, relatively, 235 
Minimal Weierstrass equation, 321, 403, 
478, 480 
algorithm, 364, 403 
for p > 3, 406 
for p > 5, 405 
Modular j-invariant 
See j-invariant 
Modular curve X(1), 14 
affine part Y(1), 14 
complex structure on, 16, 20, 26, 28, 
35 
cusp, 14 
differential forms on, 23, 28, 87 
has genus zero, 20, 21 
is a Hausdorff space, 18, 19 
is a Riemann surface, 20, 32 
is compact, 19 
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Modular curve X (1) (continued) 
is connected, 21 
j-function on, 23, 34, 36 
local parameter, 20, 21, 29, 30, 85 
meromorphic functions on, 23, 35 
open cover of, 20 
order of a differential form, 28 
projection is open map, 19 
topology on, 16, 18, 19 
Modular curve X1(11), 190, 279 
Modular discriminant 
See Discriminant 
Modular form, 25, 91See also Modular 
function, cusp form 
action of Hecke operator, 76 
dimension of space of, 31, 87 
Eisenstein series is a, 25, 31, 32 
Fourier coefficients of, 82, 92 
L-series attached to, 93 
size of Fourier coefficients, 82, 92 
space of (Mp;), 31, 87 
space of is generated by G4, G6, 88 
spaces with dimension one, 33 
Modular function, 24, 91 
See also j-invariant; modular form; 
cusp form 
jis a, 34 
action of Hecke operator, 76, 79, 91 
associated differential form, 28, 31, 87, 
91 
associated lattice function, 74 
eigenfunction for Hecke operator, 77 
formula for order, 30 
Fourier series, 24, 76, 145 
holomorphic at oo, 24 
meromorphic at oo, 24 
of weight zero, 34, 35, 182 
order at oo, 24 
value at oo, 24, 32 
weight of, 24 
why even weight, 24 
Modular group I'(1), 6-14 
See also Action of SL2(Z) on H 
generated by S and T, 14, 24, 62, 85 
is a free product, 14, 85 
Modular polynomial 
examples, 148 
Fy, 144, 146, 181, 182 
@,, 181, 182 
Hn, 144, 146 
size of coefficients, 181 
Monster group, 61 
Montgomery, H., 487 
Mordell, L.J., 61, 79 
Mordell conjecture, 271 
Mordell-Weil group 
Euclidean structure, 247 
lattice structure, 254 
Mordell-Weil theorem 
bound for rank, 279 
for abelian varieties, 485 
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for elliptic surfaces, 276 
for function fields, 230, 279 
for split elliptic surfaces, 231, 281 
ineffective, 277 
over finitely generated fields, 279 
relative, 231, 281 
weak, 191-195, 230 
Morphism 
birational, 235 
diagonal, 300, 309, 327 
divisor associated to, 258 
étale, 306 
flat, 304, 397 
graph of, 300 
of S-schemes, 297 
proper, 305 
separated, 305 
smooth, 304, 305, 306, 320 
universally closed, 305 
Multiple fibers, 203 
Multiplication map 
continuous for v-adic topology, 460 
on elliptic curve, 310 
on group scheme, 310, 398 
Multiplication by an idele, 159, 170 
Multiplication-by-n map on an elliptic 
surface, 266 
Multiplication-by-x map on Q/Z, 152, 
153 
Multiplicative group, 128, 151, 398, 399 
exponential map, 151 
formal, 149, 431 
over a field, 291, 293, 398 
over R, 308 
over S, 308 
over Z, 308 
rational points of, 292 
S-valued points, 397 
Tate module of, 382 
torsion subgroup, 151 
Multiplicative reduction, 287, 369, 388, 
399 
conductor, 381 
Néron model, 403 
non-split, 171, 378 
split, 171, 362, 377, 379 
torsion for, 453 
Multiplicity of fibral components, 313 


Nagao, K., 272 

Nakai-Moishezon criterion, 258, 261, 283, 

284 

Nakayama’s lemma, 103 

Namikawa, Y., 355, 486 

Natural map, 197, 198 

Negation map, on an elliptic curve, 325 

Néron, A., 231, 265, 272, 352, 454, 455, 
485 

Néron height function 

See Local height function 
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Néron-Kodaira classification of fibers, 
350, 352 
Néron mapping property, 319, 329, 333, 
338 
Néron model, 319, 361 
addition formula on Weierstrass equa- 
tion, 323 
automorphism of, 400 
connected component, 326, 361, 401 
dual isogeny, 400 
existence, 325, 332, 335, 337 
extension of group law, 332, 335 
generic fiber, 319 
group of components, 350, 362, 363, 
364, 379, 402, 486 
group of sections, 319 
group scheme action, 326, 400 
identity component is Weierstrass 
equation, 362, 378 
identity component of special fiber, 
326, 361 
identity section, 326 
isogeny, 400 
multiplicative reduction, 403 
negation map on Weierstrass equation, 
325 
Néron mapping property, 319, 320 
not proper, 319 
of Jacobian variety, 350, 486 
of Tate curve Eg, 435 
over a strictly Henselian ring, 402 
over strictly Henselian ring, 332, 335 
special fiber, 350, 361, 400, 435 
subgroup scheme, 401 
translation map, 400 
uniqueness, 319 
unramified base extension, 320 
Weierstrass equation, 321, 329, 362, 
369, 378, 399, 400 
Néron notation for special fiber, 353 
Néron-Ogg-Shafarevich criterion, 140, 
151, 148, 170, 362, 381, 383 
Néron-Severi group 
of an elliptic surface, 254 
rank, 254 
Nice scheme, 311, 399 
Node, 287, 370 
Non-linearity, textual, 217 
Non-perfect field, 187 
Non-singular arithmetic surface, 311 
Non-singular point 
on fiber of arithmetic surface, 315, 351, 
399 
on reduction, 362, 379 
on scheme 
See Regular point 
Non-singular scheme 
See Regular scheme 
Non-split multiplicative reduction, 378, 
399 
Norm map 
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compatibility with reciprocity map, 
120 
of fields, 444 
on ideles, 119 
Normal law, 334, 335 
associativity, 334, 335 
induces group law, 334, 337 
on symmetric product of a curve, 402 
Normal scheme, 311 
principal divisor on, 328 
Normalized basis, 7 
Normalized eigenfunction, 78, 79, 80, 92, 
93 
See also Figenfunction 
Normalized elliptic curve (E,[-]), 97, 
131, 134 
Normalized lattice, 47 
Nullstellensatz, 259 
Number field 
abelian extension, 128 
See also Class field theory 
Artin map, 117, 118, 154 
decomposition group, 116 
ideal class group, 99, 180 
idele group, 119 
maximal abelian extension, 120, 121 
of class number one, 138, 141 
Q(i), 138 
unit group of, 191 
Zy-extension, 186 
Numerical equivalence, 486 


Ogg, A., 355, 389, 487 
Ogg’s formula, 363, 364, 387, 389, 396, 
407 
One-cocycle on elliptic surface, 284 
One-parameter family of elliptic curves, 
188, 200 
Operator 
Hecke 
See Hecke operator 
homothety, 68, 90 
Order 
at oo of a modular function, 24 
conductor of, 178 
non-maximal, 160, 178, 180 
of a differential form, 27 
Ordinary reduction, 179, 184 
Oriented basis, 6, 9, 71, 72, 73, 89 
Orthogonal group, 396 
special, 396 
Osterlé, J., 278 


p-adic analysis, 424, 429 
p-adic field, 423 

squares in, 442 
p-adic period, 364 
p-adic Tate module, 186 
p-primary component, 157 
gy, See Weierstrass ¢ function 
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Parallelogram law, 218 
for height, 213 
Path independent integral, 198 
Perfect field, 330, 332 
Period map, 43 
Period, p-adic, 364 
Periodic second Bernoulli polynomial, 
480 
Periods, 37 
Petersson, H., 61, 80 
Petersson inner product, 92 
Hecke operators are self-adjoint, 92 
Picard group 
See also Jacobian variety 
divisor class of degree zero, 197 
elements of finite order, 192, 195, 199 
induced homomorphism, 255 
is ideal class group, 192 
of P?, 232, 282 
of P”, 282 
of a curve, 402 
of a fibered surface, 284 
of a surface, 232 
of curve of genus two, 287 
of hyperelliptic curve, 287 
Points 
of a group scheme, 309, 397 
of a scheme, 298, 309, 397 
Polar divisor on normal scheme, 328 
Positive divisor, 341 
associated height, 285 
Potential good reduction, 140, 148, 151, 
176, 383, 447 
Power divisor function o, 
See ox 
Prime ideal 
in arithmetic progression, 118, 123 
in ideal class, 118, 123 
inert, 184 
split, 184 
splits completely, 117, 118, 179 
Primitive Dirichlet character, 93 
Primitive root of unity, 21, 23, 29, 154 
Principal branch 
of logarithm, 44, 54, 56, 66 
of square root, 65, 67 
Principal divisor, 232 
on an arithmetic surface, 339 
Principal ideal, 132 
congruent to 1 modulo c, 117 
Pro-finite topology, 120, 182 
Pro-p group, 150, 182 
Product 
fiber, 299 
restricted, 119 
Product expansion of sine function, 56, 
58 
Projection morphism on fiber product, 
299 
Projective closure of a scheme, 371 
Projective line 
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arithmetic genus, 343, 352 
arithmetic surface, 312 
proper over R, 312 
smooth over R, 312 
Projective plane, Picard group of, 232, 
282 
Projective scheme, is proper, 303, 322 
Projective space 
height on, 255 
over a ring, 298 
over a scheme, 301, 303, 322 
Picard group of, 282 
Proper arithmetic surface, 311, 316, 317, 
351 
Proper group scheme, 327 
Proper morphism, 297, 305 
intuitive definition, 301 
valuative criterion, 303, 317, 329, 333 
Proper regular model 
minimal, 325, 332, 335, 337, 389, 400 
of a curve, 317 
Proper scheme, 312, 321, 329, 399 
group, 327 
map from smooth scheme, 328 
projective scheme is, 303, 322 
Proper S-scheme, 301, 305 


q-expansion 
See also Fourier coefficients 
of Bernoulli polynomial, 480 
of discriminant A, 59, 409, 467, 468 
of gg and gz, 409 
of Gox, 55 
of j-invariant, 59, 144, 145, 146, 410 
of a modular function, 24, 55, 182 
of g and g’, 47, 50, 409 
of Weierstrass o function, 53, 467 
of Weierstrass ¢ function, 52 
Quadratic character, 440, 444 
Quadratic field, non-maximal order, 178, 
180 
Quadratic form, canonical height, 454 
Quadratic imaginary field 
Q(i), 138 
abelian extension of, 128, 135 
algorithm to compute ideal class 
group, 85 
Hilbert class field of, 95, 121, 122, 142, 
166, 181 
ideal class group, 85 
maximal abelian extension, 135 
of class number one, 138, 141 
ring of integers, 85 
Quadratic transformation, 244 
Quasi-parallelogram law, 213, 467, 476 
explicit O(1) estimate, 280 
Quasi-period homomorphism, 41, 43, 44, 
52, 53, 65, 89, 412, 464, 465, 466 
See also Legendre relation 
formula for, 41 
Quasi-periodicity of ¢, 40 
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Quaternion algebra, 130 

Quaternion group, 404 

Quotient topology, 85 
on X(1), 18, 19 


Ramanujan , S., 61, 79 
Ramanujan conjecture, 61, 81 
Ramanujan 7 function, 59, 60, 482 
growth properties, 61 
multiplicative identities, 61, 79 
special values, 60 
Ramification index, 283 
Ramification, wild, 380 
Rank 
high over Q, 272 
of Néron-Severi group, 254 
upper bound, 279 
Rank one module, 102, 138, 186 
Rational map, 203 
birational isomorphism, 204 
blowing-up, 204, 235 
domain of definition, 204, 279 
dominant, 204, 222, 228 
from smooth scheme to proper scheme, 
328 
image, 204 
is algebraic, 204, 279 
is irreducible, 204, 280 
naive definition, 204 
of elliptic surfaces, 206 
of varieties, 132 
Ray class field, 117, 118, 129, 135, 161 
conductor of, 117, 118, 129, 135 
idelic characterization, 120, 162 
modulo unit ideal, 118 
of Q(i), 138, 180 
of Q(V—3), 180 
primes splitting completely, 118 
Raynaud, M., 350, 486 
Real Lie group, 420 
Reciprocity 
Artin, 117 
Dedekind, 67, 90 
map, 120, 152, 159, 165, 166, 168, 169, 
174 
Reduction map 
injective on torsion, 162 
surjectivity, 330, 333, 337, 401 
Reduction 
additive, 171, 287, 381, 388, 399, 403 
bad, 321 
good, 287, 321, 329, 381, 383, 388 
multiplicative, 171, 287, 369, 381, 388, 
403 
non-singular points, 362, 379 
non-split multiplicative, 399 
of a scheme, 300 
of an isogeny, 124 
of Weierstrass equation, 362, 378 
ordinary, 179 
semi-stable, 388 
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split multiplicative, 399 
Reduction type, 352, 353, 363, 364 
over 3-adic field, 406, 407 
Regular 
arithmetic surface, 311 
differential form, 27 
in codimension one, 311 
Regular arithmetic surface, 316, 317, 
351, 370, 399 
non-singular point on fiber, 315, 351, 
399 
smooth part, 316, 318, 321, 325, 332, 
335, 361, 362, 369, 378, 399, 400 
Weierstrass equation, 399 
Regular local ring, 302, 370 
localization is regular, 397 
of dimension one, 328 
of dimension two, 315 
Regular model, minimal proper, 325, 
332, 335, 337, 400 
Regular point, 302 
on a scheme, 397 
Regular scheme, 302, 398, 399 
example, 313 
intuitive definition, 301 
of dimension one, 302 
smooth part, 399 
Weierstrass equation, 399 
Regulator, height, 273 
Reimann conditions, 196, 198 
Relative differentials, 306 
Relative dimension of a smooth mor- 
phism, 305 
Relatively minimal fibered surface, 243 
Relatively minimal surface, 235 
Representation 
irreducible, 405 
of type Eg, 189 
Residue symbol, 6th-power, 177 
Resolution of singularities, 235 
Restricted product, 119 
Resultant, 458 
Riemann ¢ function, 25, 55, 171 
functional equation, 83 
special values, 26, 33, 57 
Riemann hypothesis for varieties over 
finite fields, 61, 81 
Riemann surface, 14, 19, 23, 27 
See also Curve 
differential 1-forms on, 27, 43 
meromorphic k-forms on, 27 
Riemann-Roch theorem, 28, 30, 32, 87, 
198, 226, 238, 345, 430 
Rigid analysis, 430 
Rigidity lemma, 296, 396 
Ring of integers of a quadratic imaginary 
field, 85 
Ring 
Krull dimension of, 301 
of correspondences, 70 
regular local, 302 
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R-morphism, 297 
Rohrlich, D., 485 
Root lattice of type E,, 254 
Root of unity 
primitive, 21, 23, 29, 154 
group scheme of, 397, 398 
Rosen, M., 385, 485 
R-scheme, 297 
generic fiber, 300 
reduction modulo p, 300 
set of R-valued points, 298 
Ruled surface, 236 
R-valued points, 298 
of an affine scheme, 298 


sx(q), 410 
alternative form, 448 
si(q), 411, 425 
Sn, 72, 143 
order of, 72 
$*, 181 
order of, 181 
o See Weierstrass o function 
OR, 55, 61 
Saito, T., 389, 390 
Scheme, 434 
dimension, 302 
excellent, 311 
fiber product, 299 
finite type, 311 
flat, 311 
generic fiber, 300 
generic point, 304 
group, 306 
integral, 311 
nice, 311, 399 
non-singular point, 302 
normal, 311, 328 
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Weierstrass equation, 321, 329, 362, 
369, 378, 399, 400 
Smooth special fiber, 401 
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generated by torsion, 383 
of discrete valuation ring, 399 
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v-adic topology 
multiplication map is continuous, 460 
on a field, 455 
on an elliptic curve, 455, 476 


Valuation 
of a curve on an arithmetic surface, 
311, 313, 339 


on a function field, 194 
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birational equivalence of, 204, 205 
category up to birational equivalence, 
205 
curve through two points, 228, 281 
Frobenius map of, 132 
group, 291 
group of divisors, 255 
height associated to a morphism, 258 
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Weierstrass equation 
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algorithm to find minimal, 364, 403 
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group scheme, 321, 329, 362, 369, 378, 
399, 400 
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362, 378 
minimal, 321, 403, 478, 480 
for p > 3, 406 
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reduction modulo p, 362, 378 
scheme defined by, 321, 329, 362, 369, 
378, 399, 400 
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378, 399, 400 
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464 
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factorization by o, 45, 63, 90, 412, 464, 
467 
Fourier expansion, 47, 50, 409 
Laurent series, 39 
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